On a Monotone Dynamic Approach to Optimal Stopping Problems for Continuous-Time Markov Chains
Laurent Miclo (IMT, TSE), St\'ephane Villeneuve (TSE)

TL;DR
This paper introduces a new monotone dynamic method for numerically solving optimal stopping problems related to perpetual American options driven by continuous-time Markov chains, ensuring convergence to the value function.
Contribution
The paper presents a novel monotone sequence approach for pricing American options in Markov chain models, with minimal assumptions and proven convergence.
Findings
The method effectively computes option values.
Convergence of the sequence to the true value function is established.
Applicable under minimal assumptions about payoffs and Markov chains.
Abstract
This paper is concerned with the solution of the optimal stopping problem associated to the valuation of Perpetual American options driven by continuous time Markov chains. We introduce a new dynamic approach for the numerical pricing of this type of American options where the main idea is to build a monotone sequence of almost excessive functions that are associated to hitting times of explicit sets. Under minimal assumptions about the payoff and the Markov chain, we prove that the value function of an American option is characterized by the limit of this monotone sequence.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Advanced Queuing Theory Analysis · Capital Investment and Risk Analysis
On a Monotone Dynamic Approach to Optimal Stopping Problems for Continuous-Time Markov Chains
Laurent Miclo111CNRS and Toulouse School of Economics. and Stéphane Villeneuve222Toulouse School of Economics.
Abstract
This paper is concerned with the solution of the optimal stopping problem associated to the valuation of Perpetual American options driven by continuous time Markov chains. We introduce a new dynamic approach for the numerical pricing of this type of American options where the main idea is to build a monotone sequence of almost excessive functions that are associated to hitting times of explicit sets. Under minimal assumptions about the payoff and the Markov chain, we prove that the value function of an American option is characterized by the limit of this monotone sequence.
Keywords: Markov chains, Optimal Stopping, American option pricing
1 Introduction
Optimal stopping problems have received a lot of attention in the literature on stochastic control since the seminal work of Wald [16] about sequential analysis while the most recent application of optimal stopping problems have emerged from mathematical finance with the valuation of American options and the theory of real options, see e.g. [12] and [5]. The first general result of optimal stopping theory for stochastic processes was obtained in discrete time by Snell [14] who characterized the value function of an optimal stopping problem as the least excessive function that is a majorant of the reward. For a survey of optimal stopping theory for Markov processes, see the book by Shiryaev [13]. Theoretical and numerical aspects of the valuation of American options have been the subject of numerous articles in many different models including discrete-time Markov chains (see e.g. [3],[10]), time-homogenous diffusions (see e.g. [4]) and Lévy processes (see e.g. [11]) . Following the recent study by Eriksson and Pistorius [erikssonpistorius15], this paper is concerned with optimal stopping problems in the setting of a continuous-time Markov chain. This class of processes, which contains the classic birth-death process, have recently been introduced in finance to model the state of the order book, see [1]. Assuming a uniform integrability condition for the payoff function, Eriksson and Pistorius [7] have shown that the value of an optimal stopping problem for a continuous-time Markov chain can be characterized as the unique solution to a system of variational inequalities. Furthermore, when the state space of the underlying Markov chain is a subset of and when the stopping region is assumed to be an interval, their paper provides an algorithm to compute the value function.
Our approach is different and relies on a monotone recursive construction of both the value function and the stopping region along a sequence of almost excessive functions build along the hitting times of explicit sets. Using the Snell characterization of the value function as the smallest excessive majorant of the payoff function has been already the idea of the two papers [9] and [2] in the one-dimensional diffusion case where upper bounds of the value function are build using linear programming. The main advantage of the monotone approach developed here, is that it converges to the value with minimal assumptions about the continuous-time Markov chain and the payoff function. In particular, we abandon the uniform integrability condition while the state space is not necessary a subset of the set of real numbers. Such an approach gives a constructive method of finding the value function and seems to be designed for computational methods. It is fair to notice however, that this procedure may give the exact value of the value function only after infinite number of steps. A practical exception is given when considering the case of Markov chains with finite number of states where the resulting algorithm resembles the elimination algorithm proposed in [15] and thus converges in a finite number of steps.
2 Formulation of the problem
On a countable state space endowed with the discrete topology, we consider a Markov generator , that is an infinite matrix whose entries are real numbers satisfying
[TABLE]
We define and assume that for every .
For any probability measure on , let us associate to a Markov process defined on some probability space whose initial distribution is . First we set and is sampled according to . Then we consider an exponential random variable of parameter . If , we have a.s. and we take for all , as well as for all . If , we take for all and we sample on according to the probability distribution . Next, still in the case where , we sample an inter-time as an exponential distribution of parameter . If , we have a.s. and we take for all , as well as for all . If , we take for all and we sample on according to the probability distribution . We keep on following the same procedure, where all the ingredients are independent, except for the explicitly mentioned dependences.
In particular, we get a non-decreasing family of jump times taking values in . Denote the corresponding exploding time
[TABLE]
When , we must still define for . So introduce a cemetery point not belonging to and denote . is seen as the Alexandrov compactification of . We take for all to get a -valued Markov process . Let be the completed right-continuous filtration generated by and let (resp.) be the set of functions defined on taking values in (resp. ). The generator acts on via
[TABLE]
We would like to extend this action on , but since its elements are allowed to take the value , it leads to artificial conventions such as . The only reasonable convention is , so let us introduce , the infinite matrix whose diagonal entries are zero and which is coinciding with outside the diagonal. Its interest is that acts obviously on through
[TABLE]
In this paper, we will consider an optimal stopping problem with payoff , where and , given by
[TABLE]
where is a set of -adapted stopping times and where the in index of the expectation indicates that starts from . A stopping time is said to be optimal for if
[TABLE]
Observe that with our convention, we have on the set .
There are two questions to be solved in connection with Definition (2.2). The first question is to value the function while the second is to find an optimal stopping time . Note that optimal stopping times may not exist (see [13] Example 5 p.61) . According to the general optimal stopping theory, an optimal stopping time, if it exists, is related to the set
[TABLE]
called the stopping region. In particular, when satisfies the uniform integrability condition
[TABLE]
the stopping time is optimal if for all (see Shiryaev [13] Theorem 4 p.52).
The main objective of this paper is to provide a recursive construction of both the value function and the stopping region without assuming the uniform integrability condition. However, to present the idea of the monotone dynamic approach developed in this paper, Section 3 first consider the case of a finite state space for which the uniform integrability condition is obviously satisfied. Section 4 is devoted to the general case. Section 5 revisits an example of optimal stopping with random intervention times.
3 Finite state space
On a finite set , the payoff function is bounded and thus the value function defined by (2.2) is well-defined for every . Moreover, it is well-known (see [13], Theorem 3) that the value function is the minimal -excessive function which dominates . Recall that a function is -excessive if . Moreover, on the set , satisfies . Because of the finiteness of , the process
[TABLE]
is a -martingale under for every function defined on and every which yields by taking expectations, the so-called Dynkin’s formula.
We first establish some properties of the stopping region . Let us introduce the set
[TABLE]
and assume that for some . We recall that a Markov process is said to be irreducible if for all where
[TABLE]
Lemma 1**.**
We have the inclusion and when we assume furthermore that is irreducible, we have .
Proof.
Because is -excessive, we have for all ,
[TABLE]
Therefore, .
For the second inclusion, let be the first time hits . We have for all , and every ,
[TABLE]
Letting tend to , we obtain because is bounded on the finite sate space
[TABLE]
where the last strict inequality follows from the fact that is irreducible.
∎
Now, we introduce as the value associated to the stopping strategy Stop the first time enters in . Formally, let us define
[TABLE]
and
[TABLE]
Clearly by Definition (2.2). Moreover, we have on .
Lemma 2**.**
We have
- •
, and .
- •
, .
Proof.
Let . Applying the Optional Sampling theorem to the bounded martingale
[TABLE]
we have,
[TABLE]
because for . Moreover, for , almost surely. Thus, the Strong Markov property yields
[TABLE]
from which we deduce .
Because is -excessive, we have for all ,
[TABLE]
∎
To start the recursive construction, we introduce the set
[TABLE]
and the function
[TABLE]
where
[TABLE]
Observe that if , is a -excessive majorant of and therefore . Because the reverse inequality holds by definition, the procedure stops.
By induction, we shall define a sequence for starting from by
[TABLE]
and
[TABLE]
where
[TABLE]
Next lemma proves a key monotonicity result.
Lemma 3**.**
We have and , .
Proof.
To start the induction, we assume using Lemma 2 that satisfies
[TABLE]
For , we have . On the other hand, for , we have
[TABLE]
because outside .
Let . If , we have and thus . Now, let and let us define
[TABLE]
Clearly, . Therefore by the Strong Markov property,
[TABLE]
because on the set . ∎
According to Lemma 3, the sequence is increasing and satisfies with strict inequality outside , while by construction, the sequence is decreasing. It follows that we can define a function on by
[TABLE]
and a set by
[TABLE]
We are in a position to state our first result.
Theorem 4**.**
We have and .
Proof.
By definition, for every and thus passing to the limit, we have . To show the reverse inequality, we first notice that for every , we have and thus .
If then for every and thus for every . Passing to the limit, we obtain
[TABLE]
If then there is some such that for . Thus, for such a , we have
[TABLE]
Passing to the limit, we obtain
[TABLE]
To conclude, we observe that because for every , we have , we have for every stopping time
[TABLE]
from which we deduce that
[TABLE]
because . Taking the supremum over at the right-hand side of (3.3), we obtain .
Equality implies that . To show the reverse inclusion, let which means that for larger than some . Lemma 3 yields that for and because is increasing, we deduce that for which concludes the proof.∎
Remark 5**.**
Because is finite, the sequence is constant after some and therefore the procedure stops after at most steps.
Example 6**.**
Let be a birth-death process on the set of integers stopped the first time it hits or . We define for ,
[TABLE]
*and . We define as the reward function.
Clearly, and thus the stopping region contains the extreme points . We define*
[TABLE]
A direct computation shows that for , and for . Therefore,
[TABLE]
with , where is the least integer greater than or equal to . In particular, when , we have and thus . Assume now that . To start the induction, we define
[TABLE]
and
[TABLE]
and we construct by solving for , the linear equation
[TABLE]
The function is thus explicit and denoting
[TABLE]
[TABLE]
we have
[TABLE]
Observe that and thus . As a consequence, does not belong to the set
[TABLE]
Therefore, if , we have
[TABLE]
or, if
[TABLE]
Following our recursive procedure, after steps, we shall have eliminated the negative integers and thus obtain
[TABLE]
for some . Note that for , we have
[TABLE]
If , the stopping region coincides with , else we define
[TABLE]
and
[TABLE]
and we repeat the procedure.
4 General state space
4.1 Countable State Space
When considering countable finite state space, Dynkin’s formula that has been used in the proofs of Lemma 2 and 3 is not directly available, because nothing prevents the payoff to take arbitrarily large values. Nevertheless, we will adapt the strategy used in the case of a finite state space to build a monotone dynamic approach of the value function in the case of a countable finite state space.
Hereafter, we set some payoff function and . We will construct a subset and a function by the following recursive algorithm.
We begin by taking and . Next, let us assume that and have been built for some such that
[TABLE]
Observe that it is trivially true for . Then, we define the subset as follows
[TABLE]
where the inequality is understood in .
Next, we consider the stopping time
[TABLE]
with the usual convention that . For , define furthermore the stopping time
[TABLE]
and the function given by
[TABLE]
Remark 7**.**
The non-negative random variable is well-defined, even if , since the convention imposes that whatever would be , which is not defined in this case. The occurrence of should be quite exceptional: we have
[TABLE]
in particular it never happens if for all , i.e. when is the only possible absorbing point for .
Our first result shows that the sequence is non-decreasing.
Lemma 8**.**
We have
[TABLE]
Proof.
We first compute
[TABLE]
Note that on the event , we have that , so the first term in the above r.h.s. is equal to
[TABLE]
On the event , we have that , where , for , is the shift operator by time on the underlying canonical probability space of cà dlà g trajectories. Using the Strong Markov property of , we get that
[TABLE]
For , consider two situations:
- •
if , we have a.s. and as in Remark 7, we get
[TABLE]
- •
if , we compute, in ,
[TABLE]
By our conventions, the equality
[TABLE]
is then true for all .
For , due to (4.1), the r.h.s. is equal to . For , by definition of in (4.3), the r.h.s. of (4.7) is bounded below by . It follows that for any ,
[TABLE]
On the event , we have and thus
[TABLE]
Coming back to (4.6), we deduce that
[TABLE]
and taking into account (4.5), we conclude that
[TABLE]
∎
The monotonicity property of Lemma 8 enables us to define the function via
[TABLE]
ending the iterative construction of the pair from . It remains to check that:
Lemma 9**.**
The assertion (4.1) is satisfied with replaced by .
Proof.
Consider for which , a. s.. For the Markov process starting from , we have for any ,
[TABLE]
The Strong Markov property of then implies that
[TABLE]
by resorting again to the computations of the proof of Lemma 8. Monotone convergence insures that
[TABLE]
so we get that for ,
[TABLE]
as wanted.∎
The sequence is non-increasing by definition, as a consequence we can define
[TABLE]
From Lemma 8, we deduce that for any ,
[TABLE]
It follows that we can define the function as the non-decreasing limit
[TABLE]
The next two propositions establish noticeable properties of the pair :
Proposition 10**.**
We have:
[TABLE]
Proof.
Since , the fact that is a non-decreasing sequence implies that . To show there is an equality on , it is sufficient to show that
[TABLE]
This is proven by an iterative argument on . For , it corresponds to the equality . Assume that on , for some . For , we have and thus for any , we get . From (4.4), we deduce that
[TABLE]
Letting go to infinity, it yields that on .
Consider . There exists such that for any , we have . Then passing at the limit for large in (4.1), we get, via another use of monotone convergence, that
[TABLE]
For , we have for any and thus from (4.3), we have . Letting go to infinity, we deduce that
[TABLE]
∎
In fact, is a strict majorant of on as proved in the following
Proposition 11**.**
We have
[TABLE]
It follows that
[TABLE]
Proof.
Consider , there exists a first integer such that and . From (4.1) and , we deduce that
[TABLE]
From (4.3) and , we get
[TABLE]
Putting together these two inequalities and the fact that , we end up with
[TABLE]
which implies that
[TABLE]
namely .
This argument shows that
[TABLE]
The reverse inclusion is deduced from Proposition 10. ∎
Another formulation of the functions , for , will be very useful for the characterization of their limit . For , let us modify Definition (4.4) to define a function as
[TABLE]
A priori there is no monotonicity with respect to , so we define
[TABLE]
A key observation is:
Lemma 12**.**
For any , we have .
Proof.
Since for any , we have , we get from a direct comparison between (4.4) and (4.10) that for any , , so letting go to infinity, we deduce that
[TABLE]
The reverse inequality is proven by an iteration over .
More precisely, since , we get by definition that .
Assume that the equality is true for some , and let us show that . For any , we have
[TABLE]
where we used Fatou’s lemma. From (4.10) and the Strong Markov property, we deduce that
[TABLE]
It follows that
[TABLE]
It remains to let go to infinity to get and , taking into account (4.11). ∎
Let be the set of -valued stopping times with respect to the filtration generated by . For and , we define
[TABLE]
Extending the observation of Remark 7, it appears that for any , the quantity
[TABLE]
is well-defined in . It is non-decreasing with respect to , since for any and any , can be written as , with . Thus we can define a function by
[TABLE]
By definition of the value function given by (2.2), we have for every and thus for every . To show the reverse inequality, consider any stopping time and apply Fatou Lemma to get
[TABLE]
Therefore, the value function coincides with the limit of the sequence . At this stage, we recall the definition of the stopping region
[TABLE]
We are in a position to state our main result
Theorem 13**.**
We have
[TABLE]
Proof.
It is sufficient to show that , since will then follow from Proposition 11 and (4.13).
We begin by proving the inequality . Fix some . By considering in (4.12) the stopping time defined in (4.1), we get for any given ,
[TABLE]
considered in (4.10). Taking Lemma 12 into account, we deduce that
[TABLE]
and letting go to infinity, we get . It remains to let go to infinity to show that .
To prove the reverse inequality , we will show by induction that for every , every and every , we have
[TABLE]
For , we have, because on the set ,
[TABLE]
Focusing on the third term, we observe, that on the set , we have where is an exponential random variable with parameter independent of . Therefore, the Strong Markov property yields
[TABLE]
Hence,
[TABLE]
where the last inequality follows from Proposition 10. This proves the assertion for . Assume now that for every and every , we have
[TABLE]
Observing that , we get
[TABLE]
which ends the argument by induction. To conclude, we take the limit at the right-hand side of inequality (4.14) to obtain for every . ∎
Remark 14**.**
Because we have financial applications in mind, we choose to work directly with payoffs of the form . Observe, however, that our methodology applies when pending the assumption on the set .
We close this section by giving a very simple example on the countable state space with a bounded reward function such that the recursive algorithm does not stop in finite time because it eliminates only one point at each step.
Example 15**.**
Let be a birth-death process with the generator on
[TABLE]
We define the reward function as
[TABLE]
We assume and . Therefore, . It is easy to show that
[TABLE]
Therefore, . At each step , because , the algorithm will remove only the integer in the set . Therefore, it will not reach the stopping region in finite time.
4.2 Measurable state space
Up to know, we have only considered continuous-time Markov chains with a discrete state space. But, it is not difficult to see that the results of the previous section can be extended to the case where the state space of the Markov chain is a measurable space. More formally, we consider on a measurable state space , a non-negative finite kernel . It is a mapping
[TABLE]
such that
- •
for any , is a non-negative finite measure on (because ),
- •
for any , is a non-negative measurable function on .
For any probability measure on , let us associate to a continuous-time Markov process whose initial distribution is . First we set and is sampled according to . Then we consider an exponential random variable of parameter . If , we have a.s. and we take for all , as well as for all . If , we take for all and we sample on according to the probability distribution . Next, still in the case where , we sample an inter-time as an exponential distribution of parameter . If , we have a.s. and we take for all , as well as for all . If , we take for all and we sample on according to the probability distribution . We keep on following the same procedure, where all the ingredients are independent, except for the explicitly mentioned dependences.
In particular, we get a non-decreasing family of jump times taking values in . Denote the corresponding exploding time
[TABLE]
When , we must still define for . So introduce a cemetery point not belonging to and denote . We take for all to get a -valued process .
Let be the space of bounded and measurable functions from to . For , the infinitesimal generator of is given by
[TABLE]
As in Section 4.1, we set some payoff function and . We will construct a subset and a function by our recursive algorithm as follows:
We begin by taking and . Next, let us assume that and have been built for some such that
[TABLE]
Observe that it is trivially true for . Then, we define the subset as follows
[TABLE]
where the inequality is understood in .
Next, we consider the stopping time
[TABLE]
with the usual convention that . It is easy to check that the proofs of Section 4.1. are directly deduced.
Remark 16**.**
Our methodology also applies for discrete Markov chains according to the Poissonization technique that we recall briefly. Consider a Poisson process of intensity and a discrete Markov chain with transition matrix or kernel . Assume that and are independent. Then, the process
[TABLE]
is a continous-time Markov chain with generator .
5 Application: optimal stopping with random intervention times
We revisit the paper by Dupuis and Wang [6] where they consider a class of optimal stopping problems that can be only stopped at Poisson jump times. Consider a probability space satisfying the usual conditions. For , let be a geometric Brownian motion solving the stochastic differential equation
[TABLE]
where is a standard -Brownian motion and and are constants. When , we take for all times . The probability space is rich enough to carry a -Poisson process with intensity that is assumed to be independent from . The jump times of the Poisson process are denoted by with
In [6], the following optimal stopping problem is considered
[TABLE]
where and is the set of -adapted stopping time for which for some . Similarly to [6], let us define and the -Markov chain to have
[TABLE]
and is the set of -stopping time with values in . To enter the continuous-time framework of the previous sections, we use Remark 16 with an independent Poisson process with intensity . To start our recursive approach, we need to compute the infinitesimal generator of the continuous Markov chain with state space in order to define
[TABLE]
Let be a bounded and measurable function on . According to Remark 16, we have,
[TABLE]
Because with , we have
[TABLE]
where
[TABLE]
is the resolvent of the continuous Markov process . Therefore, we have with
[TABLE]
First, we observe that is an interval . Indeed, let us define
[TABLE]
Clearly, for . Moreover, for ,
[TABLE]
It is well-known that for any and thus, because ,
[TABLE]
which gives that is a decreasing function on . It follows that if is not empty, then it will be an interval of the form . Now,
[TABLE]
Because , we have and therefore, for which proves that is not empty.
We will now prove by induction that for every , with and . Assume that it is true for some . Following our monotone procedure with Remark 14, we define the solution of the equation
[TABLE]
and define
[TABLE]
Let us check that with and .
To do this, we look for a function of the form
[TABLE]
We end up with the following equation on :
[TABLE]
or equivalently, (see [8], Proposition 2.1 page 10)
[TABLE]
where is the infinitesimal generator of , that is, acting on any via
[TABLE]
With this formulation we see that is given by
[TABLE]
where is the first hitting time of by our induction hypothesis.
By definition, we have
[TABLE]
thus where
[TABLE]
with
[TABLE]
To prove that is of the form , we begin by showing that
[TABLE]
To do so, introduce the hitting time
[TABLE]
Recall that the solution of (5) is given by
[TABLE]
It follows that
[TABLE]
In particular takes the value with positive probability when , but otherwise is a.s. finite. Nevertheless, taking into account that for any , we have , we can always write for :
[TABLE]
where we use the strong Markov property with the stopping time . Reversing the same argument, with replaced by , we deduce that
[TABLE]
Thus, we have
[TABLE]
In the first part of the above proof, to get the existence of , we have shown that the mapping is non-increasing on , and in particular
[TABLE]
so we get
[TABLE]
Assume now that . It means that
[TABLE]
implying that
[TABLE]
since so that .
This proves (5.7) and ends the induction argument.
According to Proposition 10 and Theorem 13 (more precisely its extension given in Subsection 4.2), the value function and the stopping set satisfy , where
[TABLE]
[TABLE]
and
[TABLE]
The stopping set is an interval that may be empty if is not finite. Using again [8], Proposition 2.1, we obtain
[TABLE]
Therefore, the function given by for any also satisfies
[TABLE]
Moreover,
[TABLE]
which yields
[TABLE]
This corresponds to the variational inequality (3.4)-(3-9) page 6 solved in [6], establishing that is non-empty.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Abergel, F. and Jedidi, A.: A Mathematical Approach to Order book Modeling, In Econophysics of Order-driven Markets , pp. 93-107.
- 2[2] Christensen, S.: A method for pricing American options using semi-infinite linear programming, Mathematical Finance , Vol 24, pp. 156-172, (2014).
- 3[3] Cox J.C, Ross S. and Rubinstein M.: Option pricing: A simplified approach, Journal of Financial economics , Vol 7, pp.229-262, (1979).
- 4[4] Dayanik S. and Karatzas, I.: On the optimal stopping problem for one-dimensional diffusions, Stochastic processes and their Applications , 107,pp. 173-212, (2003).
- 5[5] Dixit, A.K. and Pindyck, R.S. Investment under Uncertainty , Princeton University Press, (1994).
- 6[6] Dupuis P. and Wang H.: Optimal stopping with random intervention times, Advances in Applied Probability , Vol 34, pp. 141-157,(2002).
- 7[7] Eriksson B. and Pistorius M.R.: American Option Valuation under Continuous-Time Markov Chains, Advances in Applied Probability , Vol 47, 2, pp. 378-401 (2015).
- 8[8] Ethier, S.N. and Kurtz, T.G.: Markov Processes: Characterization and Convergence , Wiley Series in Probability and Statistics, (2005)
