Optimal Reduction of Public Debt under Partial Observation of the Economic Growth
Giorgia Callegaro, Claudia Ceci, Giorgio Ferrari

TL;DR
This paper models optimal public debt reduction considering partial economic information, using stochastic control and filtering techniques, and provides explicit policies in a two-regime economic setting.
Contribution
It formulates a novel partial observation control model with regime-switching and derives explicit optimal policies in a simplified two-regime case.
Findings
Reduction of the complex control problem to a full information problem via filtering.
Explicit characterization of optimal debt reduction policy in a two-regime economy.
Development of a free boundary in the associated optimal stopping problem.
Abstract
We consider a government that aims at reducing the debt-to-gross domestic product (GDP) ratio of a country. The government observes the level of the debt-to-GDP ratio and an indicator of the state of the economy, but does not directly observe the development of the underlying macroeconomic conditions. The government's criterion is to minimize the sum of the total expected costs of holding debt and of debt's reduction policies. We model this problem as a singular stochastic control problem under partial observation. The contribution of the paper is twofold. Firstly, we provide a general formulation of the model in which the level of debt-to-GDP ratio and the value of the macroeconomic indicator evolve as a diffusion and a jump-diffusion, respectively, with coefficients depending on the regimes of the economy. These are described through a finite-state continuous-time Markov chain. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal Reduction of Public Debt under Partial Observation of the Economic Growth
Giorgia Callegaro, Claudia Ceci, and Giorgio Ferrari
G. Callegaro: Department of Mathematics “Tullio Levi-Civita”, University of Padova, Via Trieste, 35121 Padova, Italy
C. Ceci: Department of Economics, University “G. D’annunzio” of Chieti-Pescara, Viale Pindaro 42, I-65127 Pescara, Italy
G. Ferrari: Center for Mathematical Economics (IMW), Bielefeld University, Universitätsstrasse 25, 33615 Bielefeld, Germany
Abstract.
We consider a government that aims at reducing the debt-to-gross domestic product (GDP) ratio of a country. The government observes the level of the debt-to-GDP ratio and an indicator of the state of the economy, but does not directly observe the development of the underlying macroeconomic conditions. The government’s criterion is to minimize the sum of the total expected costs of holding debt and of debt’s reduction policies. We model this problem as a singular stochastic control problem under partial observation. The contribution of the paper is twofold. Firstly, we provide a general formulation of the model in which the level of debt-to-GDP ratio and the value of the macroeconomic indicator evolve as a diffusion and a jump-diffusion, respectively, with coefficients depending on the regimes of the economy. These are described through a finite-state continuous-time Markov chain. We reduce via filtering techniques the original problem to an equivalent one with full information (the so-called separated problem), and we provide a general verification result in terms of a related optimal stopping problem under full information. Secondly, we specialize to a case study in which the economy faces only two regimes, and the macroeconomic indicator has a suitable diffusive dynamics. In this setting we provide the optimal debt reduction policy. This is given in terms of the continuous free boundary arising in an auxiliary fully two-dimensional optimal stopping problem.
Keywords: singular stochastic control; partial observation; filtering; separated problem; optimal stopping; free boundary; debt-to-GDP ratio.
MSC2010 subject classification: 93E20, 60G35, 93E11, 60G40, 60J60, 91B64.
1. Introduction
The question of optimally managing debt-to-GDP ratio (also called “debt ratio”) of a country has become particularly important in the latest years. Indeed, concurrently with the financial crisis started in 2007, debt-to-GDP ratio exploded from an average of 53% to circa 80% in developed countries. Clearly, the debt management policy of a government highly depends on the underlying macroeconomic conditions; indeed, these affect, for example, the growth rate of GDP which, in turn, determines the growth rate of the debt-to-GDP ratio of a country. However, in practice it is typically neither possible to measure in real-time the growth rate of GDP, nor one can directly observe the underlying business cycles. On August 24, 2018, Jerome H. Powell, Chairman of the Federal Reserve, said:111Speech at “Changing Market Structure and Implications for Monetary Policy”, a symposium sponsored by the Federal Reserve Bank of Kansas City in Jackson Hole, Wyoming.
…In conventional models of the economy, major economic quantities such as inflation, unemployment, and the growth rate of gross domestic product fluctuate around values that are considered “normal”, or “natural” or “desired”. The FMOC (Federal Open Market Committee) has chosen a 2 percent inflation objective as one of these desired values. The other values are not directly observed, nor can they be chosen by anyone…
Following an idea that dates back to [28], in this paper we suppose that the GDP growth rate of a country is modulated by a continuous-time Markov chain that is not directly observable. The Markov chain has states modeling the different business cycles of the economy, so that a shift in the macroeconomic conditions induces a change in the value of the growth rate of GDP. The government can observe only the current levels of the debt-to-GDP ratio and of a macroeconomic indicator. The latter might be, e.g., one of the so-called “Big Four”222These indicators constitute the Conference Board’s Index of Coincident Indicators; they are employment in non agricultural businesses, industrial production, real personal income less transfers, and real manufacturing and trade sales. We refer to, e.g., [44], where the authors present a wide range of economic indicators and examine the forecasting performance of various of them in the recession of 2001., which are usually considered proxies of the industrial production index, hence of the business conditions.
The government may intervene in order to decrease the level of the debt ratio, e.g. through fiscal policies or imposing austerity policies in the form of spending cuts. We assume that the debt ratio is instantaneously affected by any such policy. Debt reductions must not necessarily be performed at rates, but also lump sum actions are allowed, and the cumulative amount of debt ratio’s decrease is the government’s control variable. Any decrease of the debt ratio results in proportional costs, and the government aims at choosing a debt-reduction policy that minimizes the total expected costs of holding debt, plus the total expected costs of interventions on the debt ratio. In line with recent papers on stochastic control methods for optimal debt management (see [5], [6], [23] and [24]), we model the previous problem as a singular stochastic control problem. However, differently to all the previous works, our problem is formulated in a partial observation setting, thus leading to a completely different mathematical analysis. In our model, the observations consist of the debt ratio and of the macroeconomic indicator. The debt ratio is a linearly controlled geometric Brownian motion, and its drift is given in terms of the GDP growth rate, which is modulated by the unobservable continuous-time Markov chain . The macroeconomic indicator is a real-valued jump-diffusion which is correlated to the debt ratio process, and which has drift, and both intensity and jump sizes, depending on .
Our Contributions. Our study of the optimal debt reduction problem is performed thought three main steps.
First of all, via advanced filtering techniques with mixed-type observations, we reduce the original problem to an equivalent problem under full information, the so-called separated problem. The filtering problem consists in characterizing the conditional distribution of the unobservable Markov chain , at any time , given observations up to time . The case of diffusion observations has been widely studied in literature and textbook treatments can be found in [20], [32], and [37]. There are also known results for pure-jump observations (see, e.g., [4], [8], [9], [35], and references therein). More recently, filtering problems with mixed-type information, which involve pure-jump processes and diffusions, have been studied in [11], [12], and [25]. Due to the structure of our observations’ dynamics we cannot apply the probability reference method (see [12] and [46]), and for this reason we choose an alternative route based on the innovation approach, which leads to the Kushner-Stratonovich equation. Moreover, differently to [11] and [25], in our framework the innovation process is two-dimensional and, therefore, the innovation method employed in these papers must be suitably adapted to our context. By showing that the Kushner-Stratonovich equation admits a unique strong solution, we are then able to prove that the original problem under partial observation and the separated problem are equivalent in the sense that they share the same value and the same optimal control.
Secondly, we exploit the convex structure of the separated problem, and we provide a general probabilistic verification theorem. This result - which is in line with findings in [2], [15] and [23], among others - relates the optimal control process to the solution to an auxiliary optimal stopping problem. Moreover, it proves that the value function of the separated problem is the integral - with respect to the controlled state variable - of the value function of the optimal stopping problem. The stopping problem thus gives the optimal timing at which one additional unit of debt should be reduced.
Finally, by specifying a setting in which the continuous-time Markov chain faces only two regimes (a fast growth or slow growth phase) and the macroeconomic indicator is a suitable diffusion process, we are able to characterize the optimal debt reduction policy. In this framework, the filter process is a two-dimensional process , where is the conditional probability at time that the economy enjoys the fast growth phase. We prove that the optimal control prescribes to keep at any time the debt ratio below an endogenously determined curve that is a function of government’s belief about the current state of the economy. Such a debt ceiling is the free boundary of the fully two-dimensional optimal stopping problem that is related to the separated problem in the sense of the previously discussed verification theorem. By using almost exclusively probabilistic means, we are able to show that the value function of the auxiliary optimal stopping problem is a -function of its arguments, and thus enjoys the so-called smooth-fit property. Moreover, the free boundary is a continuous, bounded, and increasing function of the filter process. This last monotonicity property has also a clear economic interpretation: the more the government believes that the economy enjoys a regime of fast growth, the less strict the optimal debt reduction policy should be.
As a remarkable byproduct of the regularity of the value function of the optimal stopping problem, we also obtain that the value function of the singular stochastic control problem is a classical solution to its associated Hamilton-Jacobi-Bellman (HJB) equation. The latter takes the form of a variational inequality involving an elliptic second-order partial differential equation (PDE). It is worth noticing that the regularity of the value function implies the validity of a second-order principle of smooth fit, usually observed in one-dimensional problems.
We believe that the study of the auxiliary fully two-dimensional optimal stopping problem is a valuable contribution to the literature on its own. Indeed, if the literature on one-dimensional optimal stopping problems is very rich, the problem of characterizing the optimal stopping rule in multi-dimensional settings has been so far rarely explored in the literature (see the recent [13], [15] and [31], among the very few papers dealing with multi-dimensional stopping problems). This discrepancy is due to the fact that a standard guess-and-verify approach, based on the construction of an explicit solution to the variational inequality arising in the considered optimal stopping problem, is not anymore applicable in multi-dimensional settings where the variational inequality involves a PDE rather than an ordinary differential equation.
Related Literature. As already noticed above, our paper is placed among those recent works addressing the problem of optimal debt management via continuous-time stochastic control techniques. In particular, [5] and [6] model an optimal debt reduction problem as a one-dimensional control problem with singular and bounded-velocity controls, respectively. In [24] the government is allowed to increase and decrease the current level of debt ratio, and the interest rate on debt is modulated by a continuous-time observable Markov chain. The mathematical formulation leads to a one-dimensional bounded-variation stochastic control problem with regime switching. In [23], when optimally reducing the debt ratio, the government takes into consideration the evolution of the inflation rate of the country. The latter evolves as an uncontrolled diffusion process and affects the growth rate of the debt ratio, which is a process of bounded variation. In this setting, the debt reduction problem is formulated as a two-dimensional singular stochastic control problem whose HJB equation involves a second-order linear parabolic partial differential equation. All the previous papers are formulated in a full information setting, while ours is under partial observation.
The literature on singular stochastic control problems under partial observation is also still quite limited. Theoretical results on the PDE characterization of the value function of a two-dimensional optimal correction problem under partial observation are obtained in [38], whereas a general maximum principle for a not necessarily Markovian singular stochastic control problem under partial information has been more recently derived in [39]. We also refer to [14] and [19], where it is provided a thorough study of the optimal dividend strategy in models in which the surplus process evolves as drifted Brownian motion with unknown drift that can take only two constant values, with given probability.
Outline of the Paper. The rest of the paper is organized as follows. In Section 2 we introduce the setting and formulate the problem. The reduction of the problem under partial observation to the separated problem is performed in Section 3; in particular, the filtering results are presented in Section 3.1. The probabilistic verification theorem connecting the separated problem to one of optimal stopping is then proved in Section 3.3. In Section 4 we then consider a case study in which the economy faces only two regimes. Its solution, presented in Sections 4.2 and 4.3, hinges on the study of a two-dimensional optimal stopping problem that is performed in Section 4.1. Finally, Appendix A collects the proofs of some technical filtering results.
2. Setting and Problem Formulation
2.1. The Setting
Consider the complete filtered probability space , capturing all the uncertainty of our setting. Here, denotes the full information filtration. We suppose that such a filtration satisfies the usual hypotheses of completeness and right-continuity.
We denote by a continuous-time finite-state Markov chain describing the different states of the economy. For , let be the state space of and its generator matrix. Here, , , gives the intensity of a transition from state to state , and it is such that , for , and . For any time , is -measurable.
In absence of any intervention by the government, we assume that the (uncontrolled) debt-to-GDP ratio evolves as
[TABLE]
where is a standard -Brownian motion on independent of , and are constants, and . The constant is the real interest rate on debt, is the debt’s volatility, and is the rate of the GDP’s growth when the economy is in state .
It is clear that equation (2.1) admits a unique strong solution, and, when needed, for any we shall denote it by . The current level of the debt-to-GDP ratio is known to the government at any time , and is therefore the first component of the so-called observation process.
The government also observes a macroeconomic stochastic indicator , e.g. one of the so-called “Big Four”, which we interpret as a proxy of the business conditions. We assume that is a jump-diffusion process solving the stochastic differential equation
[TABLE]
where , , , and are measurable functions of their arguments, and is the state space of . Here, is an -standard Brownian motion, independent of and . Moreover, is an -adapted point process, without common jump times with , independent of and . The predictable intensity of is denoted by and depends on the current state of the economy, with being a measurable function. From now on, we assume the following assumptions that ensure strong existence and uniqueness of the solution to equation (2.2) (see [26], among others).
Assumption 2.1**.**
The functions , , , and are such that for any :
- (i)
(Continuity) , , and are continuous;
- (ii)
(Local Lipschitz conditions) for any , there exists a constant such that if , , then
[TABLE]
- (iii)
(Growth conditions) there exists a constant such that
[TABLE]
The dynamics proposed in equation (2.2) is of a jump-diffusive type, and it allows for a size and intensity of the jumps affected by the state of the economy. It is therefore flexible enough to describe a large class of stochastic factors which may exhibit jumps.
The observation filtration is defined as
[TABLE]
where and denote the natural filtrations generated by and , respectively, as usual augmented by -null sets. Clearly, is adapted to both and , and
[TABLE]
The above inclusion means that the government cannot directly observe the state of the economy , but this has to be inferred through the observation of . We are therefore working in a partial information setting.
2.2. The Optimal Debt Reduction Problem
The government can reduce the level of the debt-to-GDP ratio by intervening on the primary budget balance (i.e. the overall difference between government revenues and spending), for example through austerity policies in the form of spending cuts. By doing so the debt ratio dynamics modifies as
[TABLE]
The process is the control that the government chooses based on the information at its disposal. Precisely, defines the cumulative reduction of the debt-to-GDP ratio made by the government up to time , and is therefore a nondecreasing process belonging to the set
[TABLE]
for any given and fixed initial value of , initial value of , and . Here
[TABLE]
is the probability simplex on , representing the space of initial distributions of the process . From now on, we set a.s. for any .
Remark 2.2**.**
Notice that in the definition of the set above, as well as in (2.6) and in (P1) below, we have stressed the dependency on the initial data just for notational convenience, and not to stress any Markovian nature of the considered problem, which is in fact not such.
For any and , there exists a unique solution to (2.4), denoted by , that is given by
[TABLE]
where
[TABLE]
Here, and in the rest of this paper, we shall use the notation for the Lebesgue-Stieltjes integral with respect to the random measure induced by the nondecreasing process on .
Remark 2.3**.**
The dynamics (2.4) might be justified in the following way. Suppose that the public debt (in real terms), , and the GDP, , follow the classical dynamics
[TABLE]
where is the cumulative real budget balance up to time . An easy application of Itô’s formula then gives that the ratio evolves as in (2.4), upon setting .
The government aims at reducing the level of the debt ratio. Having a level of debt ratio at time when the state of the economy is , the government incurs an instantaneous cost . This cost may be interpreted as a measure of the resulting losses for the country due to the debt, as, e.g., a tendency to suffer low subsequent growth (see [22] and [45], among others). The cost function fulfills the following requirements (see also [5] and [23])
Assumption 2.4**.**
- (i)
For any , the mapping is strictly convex, continuously differentiable, and it is nondecreasing on . Moreover, ;
- (ii)
For any given and one has
[TABLE]
A quadratic cost function of the form , , , clearly satisfies Assumption 2.4 for a suitable .
Whenever the government intervenes in order to reduce the debt-to-GDP ratio, it incurs a proportional cost. We assume that the marginal cost of each intervention is normalized to one.
Given an intertemporal discount rate , for any given and fixed , the government thus aims at minimizing the expected total cost functional
[TABLE]
Here is the expectation under the condition that , has initial distribution , and . The government’s problem under partial observation can be therefore defined as
[TABLE]
One has that is well defined and finite. Indeed, it is nonnegative, due to the nonnegativity of ; moreover, since the admissible policy “instantaneously reduce at initial time the debt ratio to [math]” is a priori suboptimal and it has cost , then .
We would like to stress once more that any is -adapted, and therefore Problem (P1) is a stochastic optimal control problem under partial observation. In particular, it is a singular stochastic control problem under partial observation; that is, an optimal control problem in which the random measures induced by the nondecreasing control processes on might be singular with respect to the Lebesgue measure, and in which one component of the state variable, , is not directly observable by the controller.
In its current formulation, the optimal debt reduction problem is not Markovian and it is therefore not directly solvable by standard means of stochastic control theory. In the next section, by using techniques from filtering theory, we will introduce an equivalent problem under complete information, the so-called separated problem. This will enjoy a Markovian structure and its solution will be characterized in Section 3.3 through a Markovian optimal stopping problem.
3. Reduction to an Equivalent Problem under Complete Information
In this section we derive the separated problem. To this end, we first study the filtering problem arising in our model. As already discussed in the introduction, results on such a filtering problem cannot be directly obtained from existing literature due to the structure of our dynamics.
3.1. The Filtering Problem
The filtering problem consists in finding the best-mean squared estimate of , for any and any measurable function , on the basis of the information collected up to time . In our setting, such an information flow is given by the filtration . That estimate can be described through the filter process , which provides the conditional distribution of given , for any time (see, for instance, [37]). It is defined as the -càdlàg (right-continuous with left limits) process taking values in the space of probability measures on such that
[TABLE]
for all measurable functions on . Since takes only a finite number of values, the filter is completely described by the vector
[TABLE]
where , . With a slight abuse of notation, in the following we will denote by the process , so that for all measurable functions we have from (3.1),
Setting and, accordingly, , , notice that is clearly a bounded function. Then, define the two processes and such that for any
[TABLE]
where
[TABLE]
Henceforth, we will work under the following Novikov’s condition.
Assumption 3.1**.**
[TABLE]
Under Assumption 3.1, by classical results from filtering theory (see, e.g., [37]), the innovation processes and are Brownian motions with respect to the filtration . Moreover, given the assumed independence of and , they turn out to be independent.
The integer-valued random measure associated to the jumps of is defined as
[TABLE]
where denotes the Dirac measure at point . Notice that the -adapted random measure is such that
[TABLE]
To proceed further we need the following useful definitions.
Definition 3.2**.**
(-Predictable Process indexed by ). Given any filtration , let denote the predictable -field on and the Borel -algebra on . Any mapping which is -measurable is called -predictable process indexed by .
Letting
[TABLE]
we denote by the filtration generated by the random measure .
Definition 3.3**.**
(Dual Predictable Projection of ). Given any filtration , such that , the -dual predictable projection of , denoted by , is defined as the unique positive -predictable random measure such that for any nonnegative, -predictable process indexed by
[TABLE]
To prove that a positive -predictable random measure provides the -dual predictable projection of it suffices to prove that equation (3.9) holds true for any process of the form with a nonnegative -predictable process and . For further details we refer to [4] and [29].
We now aim at deriving an equation for the evolution of the filter (the filtering equation). To this end we use the so-called innovation approach (see [4], [11], and [37], among others), which, in our setting, requires the introduction of the -compensated jump measure of
[TABLE]
where is the -dual predictable projection of (cf. Definition 3.3 above). The triplet also represents the building block of the construction of -martingales, as it is shown in Proposition 3.5 below. We start determining the form of .
Proposition 3.4**.**
The -dual predictable projection of is given by
[TABLE]
where denotes the Dirac measure at point .
Proof.
Step 1. First, we prove that the -dual predictable projection of is given by
[TABLE]
Let and introduce
[TABLE]
is the point process counting the number of jumps of up to time with jumps’ size in the set . Since by (2.2) one has that , , and is a point process with -predictable intensity given by , we obtain that for each nonnegative -predictable process
[TABLE]
That is, for any , we have that provides the -predictable intensity of the counting process . Recalling (3.13) and Definition 3.3, this implies that given in (3.12) coincides with the -dual predictable projection of , since equation (3.9) holds with the choice and , with an arbitrary nonnegative -predictable process and .
Step 2. As in Proposition 2.3 in [10] we can now derive the -dual predictable projection of , denoted by , by simply projecting with respect to the observation flow . Precisely, we have that the -predictable intensity of the point process , , is given by
[TABLE]
This implies that is given by (3.11), since (3.9) holds with the choice , , with and an arbitrary nonnegative -predictable process. ∎
An essential tool to prove that the original problem under partial information is equivalent to the separated one is the characterization of the filter as the unique solution to the filtering equation (see [7] and [21]). In order to derive the filtering equation solved by we first give a representation theorem for -martingales. The proof of the following technical result is given in Appendix A.
Proposition 3.5**.**
Under Assumptions 2.1 and 3.1, every -local martingale admits the decomposition
[TABLE]
where and are -adapted processes, and is an -predictable process indexed by such that a.s.
[TABLE]
We are now in the position to prove the following fundamental result, whose proof is postponed to Appendix A.
Theorem 3.6**.**
Recall (3.10), let be the initial distribution of , and let Assumptions 2.1 and 3.1 hold. Then the filter solves the Kushner-Stratonovich system
[TABLE]
for any . Here, and
[TABLE]
denotes the Radon-Nikodym derivative of the measure with respect to .
Let us introduce the sequence of jump times and jump sizes of the process , denoted by , and recursively defined as ,
[TABLE]
In the definitions above we use the standard convention that .
Then the integer-valued measure associated to the jumps of (cf. (3.6)) can also be written as
[TABLE]
The filtering system of equations (3.14) has a natural recursive structure in terms of the sequence , as it is shown in the next proposition.
Proposition 3.7**.**
Between two consecutive jump times, , the filtering system of equations (3.14) reads as
[TABLE]
for any . At a jump time of , say , jumps as well, and its value is given by
[TABLE]
Proof.
First, recalling that and that
[TABLE]
we obtain that
[TABLE]
which, from (3.14), implies that for any , solves equation (3.7).
Finally, equation (3.18) follows by (3.15) and
[TABLE]
∎
We want to stress that equation (3.18) shows that the vector is completely determined by the observed data and by the knowledge of for , since , .
Remark 3.8**.**
A few comments on the filtering equation are worth being done.
- (1)
In the case , for any and , the sequences of jump times of and coincide, and the filtering system of equations (3.14) reduces to the simpler
[TABLE] 2. (2)
In the case and , for any and , the filtering system of equations (3.14) does not depend anymore explicitly on the process . In particular, one has
[TABLE]
where we have set \alpha_{i}:=\sigma_{2}^{-1}\big{\{}b_{1}(i)-\sigma^{-1}\beta_{i}\sigma_{1}\big{\}}. With reference to (2.2) and (3.4), this setting corresponds, e.g., to the purely diffusive arithmetic case , and , , for any and , or to the purely diffusive geometric case , and , , for any and . In Section 4 we will provide the explicit solution to the optimal debt reduction problem within this setting.
3.2. The Separated Problem
Thanks to the introduction of the filter, equations (2.1), (2.2), and (2.4) can now be rewritten in terms of observable processes. In particular, we have that
[TABLE]
[TABLE]
and
[TABLE]
Notice that, for any , the process turns out to be -adapted, and depends on the vector , such that .
Definition 3.9**.**
(Strong Uniqueness). We say that a process with values in is a strong solution to equations (3.14) and (3.21) if it satisfies pathwise those equations. We say that strong uniqueness for the system of equations (3.14) and (3.21) holds if, for any strong solution to system (3.14) and (3.21), one has , , a.s. for all .
Proposition 3.10**.**
Let Assumptions 2.1 and 3.1 hold, and suppose that is locally-Lipschitz for any , and there exists such that , for any and any . Then system (3.14) and (3.21) admits a unique strong solution.
Notice that, under Assumption 2.1, the requirement on of Proposition 3.10 is verified, e.g., whenever , for some and for any , or if and are locally-Lipschitz on and have sublinear growth. The proof of Proposition 3.10 is postponed to Appendix A. As a byproduct, it also ensures strong uniqueness of the solution to (3.22). In the following, when there will be the need to stress the dependence with respect to the initial value , we shall denote the solution to (3.20) and (3.22) by and , respectively.
Since
[TABLE]
an application of Fubini-Tonelli’s theorem allows to rewrite also the cost functional of (2.6) in terms of observable quantities as
[TABLE]
Here denotes the expectation conditioned on , , and . Notice that the latter expression does not depend anymore on the unobservable process , and this allows us to introduce a control problem with complete information, the separated problem, in which the new state variable is given by the triplet . For this problem we rewrite the set in terms of the observable processes given by (3.14), (3.21) and (3.22), and we denote by such a representation of the set ; that is,
[TABLE]
for every initial value of defined in (3.22), for any initial values of the process solution to equation (3.14), and for any initial value of . In the following, we set a.s. for any .
Given , the triplet solves (3.22), (3.14) and (3.21) and the jump measure associated to has -predictable dual projection given by equation (3.11). Hence, the process is an -Markov process and we therefore define the Markovian separated problem as
[TABLE]
This is now a singular stochastic problem under complete information, since all the processes involved are -adapted.
The next proposition immediately follows from the previous construction of the separated problem, and from the strong uniqueness of the solutions to (3.14), (3.21), and (3.22).
Proposition 3.11**.**
Assume strong uniqueness for the system of equations (3.14) and (3.21), and let be the initial values of the process in the problem under partial observation (P1). Then
[TABLE]
Moreover, is an optimal control for the separated problem (P2) if and only if is an optimal control for the original problem under partial observation (P1).
Remark 3.12**.**
Notice that in the setting of Remark 3.8-(2), the pair solving equations (3.22) and (3.14), respectively, is an -Markov process, for any , . As a consequence, since the cost functional and the set of admissible controls do not depend explicitly on the process , the value function of the separated problem (P2) does not depend anymore on the variable . We will consider this setting as a case study in Section 4.
3.3. A Probabilistic Verification Theorem via Reduction to Optimal Stopping
In this section we relate the separated problem to a Markovian optimal stopping problem, and we show that the solution to the latter is directly related to the optimal control of the former. The following analysis is fully probabilistic and it is based on a change of variable formula for Lebesgue-Stieltjes integrals that has been already employed in singular control problems (see, e.g., [2] and [23]). The result of this section will then be employed in Section 4 where, in a case study, we determine the expression of the optimal debt reduction policy by solving an auxiliary optimal stopping problem.
With regard to Problem (P2), notice that \pi_{t}\big{(}h(X_{t}^{x,\nu},\cdot)\big{)}=\sum_{i=1}^{Q}\pi_{t}(i)h(X_{t}^{x,\nu},i) a.s. for any . Then, for any , set
[TABLE]
and, given , we introduce the optimal stopping problem
[TABLE]
where the optimization is taken over all the -stopping times .
Under Assumption 2.4, the expectation in (3.26) is finite for any -stopping time , for any . To take care of the event , in (3.26) we make use of the convention
[TABLE]
Denote by a càdlàg modification of , and observe that , for any , a.s. Also, define the stopping time
[TABLE]
with the convention that if the set on the right-hand side is empty. Then by Theorem D.12 in Appendix D of [34], is an optimal stopping time for problem (3.26). In particular, is optimal for the problem
[TABLE]
Notice that since is a.s. increasing, then is a.s. decreasing. Such monotonicity of will be important in the following as we will need to consider its generalized inverse. Moreover, since the triplet is an homogenous -Markov process, there exists a measurable function such that for any , a.s. Hence, , and for any , define
[TABLE]
Moreover, introduce the nondecreasing, right-continuous process
[TABLE]
and then also the process
[TABLE]
Notice that is the right-continuous inverse of .
Theorem 3.13**.**
Let be as in (3.30) and as in the definition of Problem (P2). Then one has , and is the (unique) optimal control for Problem (P2).
Proof.
Step 1. Let , , and be given and fixed. For , we introduce the process such that , , and define its inverse (see, e.g., Chapter 0, Section 4 of [43]) by
[TABLE]
Notice that the process has decreasing, left-continuous sample paths, and hence it admits right-limits
[TABLE]
Moreover, the set of points at which is a.s. countable for a.e. .
The random time is actually an -stopping time because it is the entry time of an open set of the right-continuous process , and is right-continuous. Moreover, since is the first entry time of the right-continuous process into a closed set, it is an -stopping time as well for any .
Proceeding then as in Step 1 of the proof of Theorem 3.1 in [23], by employing the change of variable formula in Chapter 0, Proposition 4.9 of [43], one finds that
[TABLE]
Hence, since was arbitrary, we find
[TABLE]
Step 2. To complete the proof we have to show the reverse inequality. Let , , and , initial values of , and . We first notice that . Indeed, is nondecreasing, right-continuous and such that a.s. for all , since one has by definition a.s. Moreover, for any , we can write (cf. (3.31) and (3.34))
[TABLE]
Then, recalling that -a.s. and for a.e. , we pick (equivalently, ), and following Step 2 in the proof of Theorem 3.1 of [23], we obtain . That is, by (3.35) and admissibility of . Therefore is optimal. In fact, is the unique optimal control in the class of controls belonging to and such that by strict convexity of . ∎
Remark 3.14**.**
For any given , define the Markovian optimal stopping problem
[TABLE]
where denotes the expectation under the probability measure such that . Then, it is readily verified that . Moreover, it holds that the stopping time
[TABLE]
is optimal for .
4. The Solution in a Case Study with Economic Regimes
In this section, we build on the general filtering analysis developed in the previous sections and on the result of Theorem 3.13, and we provide the form of the optimal debt reduction policy in a case study that is defined through the following standing assumption.
Assumption 4.1**.**
- (1)
* takes values in , and, with reference to (2.4), we take ;* 2. (2)
for any and any one has and, for as in (3.4), we take ; 3. (3)
* for all , with such that:*
- (i)
* is strictly convex, twice-continuously differentiable, and nondecreasing on with and ;*
- (ii)
there exist , and such that
[TABLE]
and
[TABLE]
Notice that under Assumption 4.1-(2) the macroeconomic indicator has a suitable diffusive dynamics whose coefficients are such that the function is independent of . As discussed in Remark 3.8-, this is the case of a geometric or arithmetic diffusive dynamics for . In this setting the Kushner-Stratonovich system (3.14) reduces to
[TABLE]
and . Here, and .
Denoting by , , problem (P2) then reads as
[TABLE]
where , denotes the rate of economic growth in the state , .
It is worth noticing that there is no need to involve the process in the Markovian formulation of problem (P3). This is due to the fact that the couple , solving the two stochastic differential equations above is a strong Markov process, and the cost functional and the set of admissible controls (denoted by above) do not depend explicitly on . For this reason the value function of Problem (P3) does not depend on the initial value of the process . However, memory of the macroeconomic indicator process appears in the filter through the constant term in its dynamics.
Finally, we recall that, thanks to Propostion 3.11, by solving Problem (P3) we are also solving the original problem (P1). Indeed, we have that
[TABLE]
and a control is optimal for the separated problem (P3) if and only if it is such for the original problem under partial observation.
In the following analysis, we need (for technical reasons due to the infinite time-horizon of our problem) to take a discount factor sufficiently large. Namely, defining
[TABLE]
with \theta^{2}:=\frac{1}{2}\big{[}\frac{(g_{1}-g_{2})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}, we assume the following.
Assumption 4.2**.**
One has .
Due to the growth condition on , Assumption 4.2 in particular ensures that so that the (trivial) admissible control has a finite total expected cost.
4.1. The Related Optimal Stopping Problem
Motivated by the results of the previous sections (see in particular Theorem 3.13), we now aim at solving Problem (P3) through the study of an auxiliary optimal stopping problem. Informally, the solution to such an optimal stopping problem gives the optimal time at which the government should reduce the debt ratio by one additional unit. The optimal stopping problem involves a two-dimensional diffusive process, and in the following we provide an almost exclusively probabilistic analysis.
4.1.1. Formulation and Preliminary Results
Recall that is a two-dimensional, standard -Brownian motion, and introduce the two-dimensional diffusion process solving the stochastic differential equations (SDEs)
[TABLE]
with initial conditions , for any . In the following, we set . Recall that .
Since the process is bounded, classical results on SDEs ensure that system (4.2) admits a unique strong solution, that, when needed, we shall denote by in order to stress its dependence on the initial datum . In particular, one easily obtains
[TABLE]
Moreover, it can be shown that the Feller’s test of explosion (see, e.g., Chapter 5.5 in [33]) gives that for all . In fact, the boundary points [math] and are entrance-not-exit (cf. [3], p. 15), hence unattainable for the process .
With regard to Remark 3.14, here we study the fully two-dimensional Markovian optimal stopping problem with value function
[TABLE]
In (4.4) the optimization is taken over all the -stopping times, and the symbol denotes the expectation under the probability measure on , defined as , for any .
Due to the fact that is positive, , and by Assumption 4.2, one has from (4.3) that
[TABLE]
which implies the convention (cf. (3.27)) on .
Clearly, one has since is positive and is increasing on . Also, on , and we can therefore define the continuation region and the stopping region as
[TABLE]
Notice that integrating by parts the term , taking expectations, and exploiting that for any -stopping time one has (because by Assumption 4.2), we can equivalently rewrite (4.4) as
[TABLE]
for any . From (4.7) it is readily seen that
[TABLE]
which implies
[TABLE]
Moreover, since satisfies Assumption 4.2, and for any , one has that
[TABLE]
and the family of random variables
[TABLE]
is therefore -uniformly integrable under .
Preliminary properties of are given in the next proposition.
Proposition 4.3**.**
The following hold:
- (i)
* is increasing for any ;*
- (ii)
* is decreasing for any ;*
- (iii)
* is continuous in .*
Proof.
We prove each claim separately.
(i). Recall (4.4). By the strict convexity and the monotonicity of and (4.3), it follows that is increasing for any -stopping time , and for any . Hence the claim is proved.
(ii). This is due to the fact that is decreasing for any stopping time and any . Indeed, the mapping is a.s. decreasing for any (because is a.s. increasing by the comparison theorem of Yamada and Watanabe - see, e.g., Proposition 2.18 in Chapter 5.2 of [33] - and ), and is increasing.
(iii). Since is a.s. continuous for any , it is not hard to verify that is continuous for any given . Hence, is upper semicontinuous. We now show that it is also lower semicontinuous.
Let and let be any sequence converging to . Without loss of generality, we may take , for a suitable . Letting be an -optimal for , but suboptimal for , we can then write
[TABLE]
Notice now that a.s.
[TABLE]
where we have used that is increasing, is decreasing, and is positive and increasing. The random variable on the right-hand side of the latter equation is independent of and integrable due to (4.10).
Also, by an integration by parts, and performing standard estimates, we can write that a.s.
[TABLE]
and the last integral above is independent of and it has finite expectation due to (4.10).
Then, taking limits as , invoking the dominated convergence theorem thanks to the previous estimates, and using that is a.s. continuous for any we find (after rearranging terms) that
[TABLE]
We thus conclude that is lower semicontinuous at by arbitrariness of . Since was arbitrary as well, then is lower semicontinuous on . ∎
Due to Proposition 4.3-(iii) one has that the stopping region is closed, whereas the continuation region is open. Moreover, thanks to (4.10) and the -a.s. continuity of , we can apply Theorem D.12 in Appendix D of [34] to obtain that the first entry time of into is optimal for (4.4); that is,
[TABLE]
attains the infimum in (4.4) (here we adopt the usual convention ).
Also, by employing standard means based on the strong Markov property of (see, e.g., [40], Ch. I, Sec. 2, Thm. 2.4), one can show that, -a.s., the process S:=\big{(}S_{t}\big{)}_{t\geq 0}, with
[TABLE]
and that the stopped process (S_{t\wedge\tau^{\star}}\big{)}_{t\geq 0} is an -martingale. The latter two conditions are usually referred to as the subharmonic characterization of the value function .
We now rule out the possibility of an empty stopping region.
Lemma 4.4**.**
The stopping region of (4.6) is not empty.
Proof.
We argue by contradiction and we suppose that . Hence, for any we can write
[TABLE]
where the inequality , due to convexity of , and the growth condition assumed on (cf. Assumption 4.1) have been used. Now, by taking sufficiently large, we reach a contradiction since by assumption. Hence . ∎
Proposition 4.5**.**
For any let
[TABLE]
where the convention has been used. Then
- (i)
[TABLE]
- (ii)
* is increasing and left-continuous;*
- (iii)
there exist such that for any
[TABLE]
Proof.
(i). To show that (4.15) holds true it suffices to show that if , then for any . Let be an -optimal stopping time for . Then, exploiting the fact that a.s. and the monotonicity of , we can write from (4.7)
[TABLE]
Therefore, by arbitrariness of , we conclude that as well, and therefore that as in (4.14) splits and as in (4.15).
(ii). Let . Since is decreasing by Proposition 4.3-(ii), it thus follows that for any . This in turn implies that is increasing. The monotonicity of , together with the fact that is closed, then give the claimed left-continuity by standard arguments.
(iii). Let \Theta^{x}_{t}:=x\exp\big{\{}(\beta_{2}-\frac{1}{2}\sigma^{2}+(g_{2}-g_{1}))t+\sigma I_{t}\big{\}}, and introduce the one-dimensional optimal stopping problem
[TABLE]
Because , is increasing, and a.s. for all and , it is not hard to see that for any .
By arguments similar to those employed to prove (i) above one can show that there exists such that . In fact, by arguing as in the proof of Lemma 4.4, one has that the latter set is not empty. Then the following inclusions hold
[TABLE]
which in turn show that for all . Hence, also for all , by setting by monotonicity, and by left-continuity.
As for the lower bound of , notice that (4.9) implies
[TABLE]
where is the inverse of the strictly increasing function (notice that since , , and ). Since is strictly increasing, and , we can conclude from (4.18) that d(y)\geq(h^{\prime})^{-1}\big{(}\rho-\beta_{2}) for every .
Moreover, setting and introducing the one-dimensional optimal stopping problem
[TABLE]
one has that for any . Following arguments as those employed above, the last inequality implies that for all , where . ∎
4.1.2. Smooth-Fit Property and Continuity of the Free Boundary
We now aim at proving further regularity of and of the free boundary .
The second-order linear elliptic differential operator
[TABLE]
acting on any function , is the infinitesimal generator of the process . The nondegeneracy of the process , the smoothness of the coefficients in (4.1.2), together with the subharmonic characterization of , allow to prove by standard arguments (see, e.g., [40], Ch. 3, Sec. 7.1) and classical regularity results for elliptic partial differential equations (see, e.g., [27]) the following result.
Lemma 4.6**.**
The value function of (4.4) belongs to separately strictly inside and (i.e. away from the boundary of ). Moreover, inside it uniquely solves
[TABLE]
with as in (4.1.2).
We continue our analysis by proving that the value function of (4.4) belongs to . This will be obtained through probabilistic methods that rely on the regularity (in the sense of diffusions) of the stopping set for the process (see [17] where this methodology has been recently developed in a general context; for other examples refer to [16] and [31]). Recall that the boundary points are regular for relative to if (cf. Definition 2.9 p. 249 in [33])
[TABLE]
The time is the first hitting time of to .
Notice that for every bounded Borel function one has \mathsf{E}_{(x,y)}\big{[}f(\widehat{X}_{t},\pi_{t})\big{]}=\mathsf{E}_{(u,y)}\big{[}f(e^{U_{t}},\pi_{t})\big{]}, where and is such that dU_{t}=\big{(}\beta_{2}+(g_{2}-g_{1})\pi_{t}-\frac{1}{2}\sigma^{2}\big{)}dt+\sigma dI_{t}. Due the nondegeneracy of the process , and the smoothness and boundedness of its coefficients, we have that has a continuous transition density , , such that for any and (see, e.g., [1])
[TABLE]
for some constants and . It thus follows that (u,y)\mapsto\mathsf{E}_{(u,y)}\big{[}f(e^{U_{t}},\pi_{t})\big{]} is continuous, so that is a strong Feller process. Hence, is strong Feller as well, and we can therefore conclude that (4.22) holds true if and only if (see [18], pp. 32-40)
[TABLE]
where is as in (4.12).
The next proposition shows the validity of (4.22).
Proposition 4.7**.**
The boundary points in are regular for relative to ; that is, (4.22) holds.
Proof.
Let , and set . With as defined above, we set , , and we equivalently rewrite (4.22) in terms of the process as
[TABLE]
Given that is increasing (since is such), then the region enjoys the so-called cone property (see [33], p. 250). In particular, we can always construct a cone with vertex in and aperture such that , and for any we can write that
[TABLE]
Then using (4.1.2) one has
[TABLE]
where we have used that the change of variable and maps the cone into itself. The number above depends on , but it is independent of . From (4.25) and (4.1.2) we thus have that , and letting we obtain . However, , and by the Blumenthal’s 0-1 Law we obtain , which completes the proof. ∎
Theorem 4.8**.**
One has that .
Proof.
The value function belongs to strictly inside the continuation region due to Lemma 4.6, and it is strictly inside the stopping region where . It thus only remains to prove that is continuously differentiable across . In the following, we will prove that: (i) the function has continuous derivative with respect to across (and this clearly implies the continuity of across ); (ii) that the function is continuous across .
(i) Continuity of across . For the subsequent arguments it is useful to notice that the function admits the representation (recall (4.7))
[TABLE]
and to bear in mind that the optimal stopping time for as in (4.12) is also optimal for since if and only if . We now prove that is continuous across , thus implying continuity of across .
Take , and let be such that . Since is increasing (due to the monotonicity of ) it is clear that as well. Denote by the optimal stopping time for , and notice that is suboptimal for and a.s. To simplify exposition in the following we write and . We can then write from (4.27)
[TABLE]
for some , and where in the last step we have used the mean value theorem, and the fact that . Letting , invoking the dominated convergence theorem (thanks to the fact that \rho>\big{(}\gamma\beta_{2}+\frac{1}{2}\sigma^{2}\gamma(\gamma-1)\big{)}\vee\big{(}2\beta_{2}+\sigma^{2}\big{)} by Assumption 4.2), and using that (since ), we then find from the latter that
[TABLE]
Let now be any arbitrary point belonging to . Taking limits in (4.28) as , by the dominated convergence theorem and thanks to Proposition 4.7 we obtain that
[TABLE]
thus proving that is continuous across . This immediately implies the continuity of across , upon recalling that .
(ii) Continuity of across . Take again , and let be such that . Since is decreasing (cf. Proposition 4.3-(ii)), it is clear that as well. Denote by the optimal stopping time for and notice that is suboptimal for and a.s. as . In order to simplify the notation, in the following we write instead of .
From Proposition 4.3-(ii) and (4.7) we can then write
[TABLE]
Now, add and subtract both and in the right-hand side of the latter, and recall that , that a.s. for every , as well as that a.s. for every . Then, after rearranging terms and employing the integral mean value theorem (for some a.s.), we obtain from the equation above that
[TABLE]
In the last inequality we have used that , since by Assumption 4.2, that , and that .
Define now , , and notice that, by using the second equation in (4.2), we can write
[TABLE]
with . With the help of Itô’s formula, it can be easily shown that
[TABLE]
with \theta^{2}:=\frac{1}{2}\big{[}\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}, solves the previous stochastic differential equation.
Also, by (4.3) and simple algebra,
[TABLE]
Employing the definition of and (4.31) in (4.1.2), and using that , one finds
[TABLE]
We now aim at taking limits as in (4.1.2). To this end, notice that a.s. for all , as , where, by Theorem in Chapter V.7 of [42], is the unique strong solution to
[TABLE]
with . Then, if we were allowed to invoke the dominated convergence theorem when taking limits as in (4.1.2), we would obtain that
[TABLE]
upon recalling that . Therefore, letting be any arbitrary point belonging to , by taking limits in (4.1.2) as , by the dominated convergence theorem and thanks to Proposition 4.7 we obtain that
[TABLE]
thus proving that is continuous across .
In order to complete the proof it thus only remains to show that the dominated convergence theorem can indeed be applied when taking limits as in (4.1.2). This is what we are going to show in the two following technical steps.
Step 1. To prove that the dominated convergence theorem can be invoked when taking in the first expectation on the right-hand side of (4.1.2), we set
[TABLE]
and we show that the family of random variables is bounded in , hence uniformly integrable.
Notice that by Assumption 4.1-(ii) and the fact that a.s., one has a.s. for any
[TABLE]
for some constant (independent of ), so that by Jensen’s inequality
[TABLE]
Then, taking expectations and applying Hölder’s inequality
[TABLE]
for some other constant , independent of , that in the following will be varying from line to line.
The standard inequality , with , allows us to continue from (4.35) and write
[TABLE]
We now treat the two expectations in (4.36) separately. First of all, notice that by Jensen’s inequality
[TABLE]
Second of all, thanks to the nonnegativity of , we can invoke Fubini-Tonelli’s theorem and using also (4.37), obtain
[TABLE]
We now aim at evaluating the expectation in the last integral above.
To accomplish that, notice that by applying Itô’s formula to the process , and using (4.30), we have for any
[TABLE]
with and \theta^{2}=\frac{1}{2}\big{[}\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}. Because a.s. for all , and
[TABLE]
where is an exponential martingale, it is easy to see that
[TABLE]
Using the latter estimate in (4.1.2), together with Assumption 4.2, we deduce that
[TABLE]
As for the second expectation in (4.36), Assumption 4.2 and standard estimates employing (4.3) (together with the fact that ) guarantee that it is finite. Moreover, it is independent of . Combining this with (4.40) we thus find from (4.36) that \sup_{\varepsilon\in(0,1-y)}\mathsf{E}\Big{[}\big{|}\Lambda_{\varepsilon}\big{|}^{2}\Big{]}^{\frac{1}{2}}<\infty, thus implying that the family of random variables is bounded in , hence uniformly integrable.
Step 2. We consider the second expectation on the right-hand side of (4.1.2), and setting
[TABLE]
we aim at proving that the family of random variables is bounded in , hence uniformly integrable.
By Jensen’s inequality first, and Hölder’s inequality then, one finds that
[TABLE]
for some , independent of .
The first expectation on the right-hand side of (4.41) is finite thanks to Assumption 4.2 and standard estimates employing (4.3) (together with the fact that ). Moreover, it is independent of .
As for the second one, by interchanging expectation and time integral by Fubini-Tonelli’s theorem, and using (4.39), we obtain
[TABLE]
due to Assumption 4.2. We therefore conclude that (cf. (4.41)) \sup_{\varepsilon\in(0,1-y)}\mathsf{E}\Big{[}\big{|}\Xi_{\epsilon}|^{2}\Big{]}^{\frac{1}{2}}<\infty, thus completing the proof. ∎
The previous theorem in particular implies the so-called smooth-fit property, a well known optimality principle in optimal stopping theory. Moreover, by standard arguments based on the strong Markov property of (see Chapter III in [40]) it follows from the results collected so far that the couple solves the free-boundary problem
[TABLE]
with .
An important consequence of Theorem 4.8 is the following.
Proposition 4.9**.**
One has that is continuous on .
Proof.
Define the probability measure on such that \frac{d\widehat{\mathsf{P}}}{d\mathsf{P}}\Big{|}_{\mathcal{F}_{t}}=e^{-\frac{1}{2}\sigma^{2}t+\sigma I_{t}}, . Such a measure is equivalent to on , and defining , by Girsanov’s theorem the latter is a standard -Brownian motion under .
By a change of measure (see, e.g., Section 12 in Chapter IV of [40]) it is then not difficult to see that as in (4.7) is such that , where, for any , we have set
[TABLE]
with \widehat{H}(x,y):=\Big{(}\rho-\beta_{2}-(g_{2}-g_{1})y-h^{\prime}(x)\Big{)}. In (4.43) above denotes the expectation conditioned on the fact that -a.s. Since , is the optimal stopping boundary for the problem with value as well.
In order to prove the continuity of , we now aim at applying Theorem 10 in [41] for problem (4.43). Notice that on since is strictly convex. Moreover, recalling , we have on thanks, again, to the strict convexity of . Also, is continuous across the boundary, due to the -property shown in Theorem 4.8 for ; hence, the horizontal smooth-fit property holds.
We can therefore apply Theorem 10 of [41] (upon noticing that in [41] is the horizontal axis and is the vertical one, while, in our paper, is the vertical axis and is the horizontal one), and conclude that cannot have discontinuities of the first kind at any point . Finally, is also continuous at since it is left-continuous by Proposition 4.5-(ii). ∎
4.2. The Optimal Control for Problem (P3)
In this section, we provide the form of the optimal debt reduction policy. It is given in terms of the free boundary studied in the previous section.
For as in (4.14), introduce under the nondecreasing process
[TABLE]
with , and then the process
[TABLE]
Notice that since a.s. for all , and is nondecreasing, it does follows from (4.45) that is admissible. Moreover, is continuous (with the exception of a possible initial jump at initial time), due to the continuity of and to that of , , and .
Theorem 4.10**.**
Let , . Then one has on , and as in (4.45) is optimal for Problem (P3).
Proof.
Recall as in (3.26), and notice that in our Markovian setting one has . By the proof of Theorem 3.13 it suffices to show that the right-continuous inverse of the stopping time (which is optimal for , cf. (4.12)) coincides (up to a null set) with .
Then, recall (3.34) from the proof of Theorem 3.13, fix , take arbitrary, and notice that by (4.12) we have -a.s. the equivalences
[TABLE]
Hence, a.s., and is the right-continuous inverse of . Since is admissible, by arguing as in Step 2 of the proof of Theorem 3.13 the claim follows. ∎
Notice that the equation of in the formulation of Problem (P3), and (4.45), yield
[TABLE]
which, with regard to (4.44), shows that
[TABLE]
Moreover, it is easy to see that we can express of (4.44) as
[TABLE]
The previous equations allow us to make some remarks about the optimal debt management policy of our problem.
- (i)
If at initial time the level of the debt ratio is above , then an immediate lump sum reduction of amplitude is optimal.
- (ii)
At any , it is optimal to keep the debt ratio level below the belief-dependent ceiling .
- (iii)
If the level of the debt ratio at time is strictly below , there is no need for interventions. The government should intervene to reduce its debt only at those (random) times at which the debt ratio attempts to rise above . These interventions are then minimal, in the sense that solves a Skorokhod reflection problem at the free boundary .
- (iv)
Recall that the debt ceiling is an increasing function of the government’s belief that the economy is enjoying a phase of fast growth. Then, with regard to the previous description of the optimal debt reduction rule, we have that the more the government believes that the economy is in a good shape, the more the fiscal space is, and the less strict the optimal debt reduction policy should be.
4.3. Regularity of the Value Function of Problem (P3) and Related HJB Equation
Combining the results collected so far, we are now able to prove that the value function of control Problem (P3) is a twice-continuously differentiable function. As a byproduct, is a classical solution to the corresponding Hamilton-Jacobi-Bellman (HJB) equation.
By Theorem 4.10 we know that , for all . Hence, thanks to Theorem 4.8 and to the dominated convergence theorem, we immediately obtain the following result.
Lemma 4.11**.**
One has that . Moreover, , as well as .
To take care of the second derivative we follow ideas used in [14]. In particular, we determine the second weak derivative of (recall that is continuous by Theorem 4.8), and we then show that it is a continuous function. This is accomplished in the next proposition.
Proposition 4.12**.**
Let . We have with
[TABLE]
Proof.
Notice that , and therefore is a continuous function for all by Theorem 4.8 (notice indeed that by the bounds in (4.1.2) and the multiplicative dependence of with respect to one has that is integrable at zero). Hence, its weak derivative with respect to is a function such that for any test function one has
[TABLE]
We now aim at evaluating and at showing that it coincides with the right-hand side of (4.12).
Denote by , , the generalized right-continuous inverse of , ; that is, . Then, noticing that on and using Fubini’s theorem, we can write
[TABLE]
where we have used that for all , , as well as .
By Lemma 4.6 (cf. also (4.1.2)), for any , for any with , we have that
[TABLE]
Inserting the latter expression in the last integral term on the right-hand side of (4.3), using again Fubini’s theorem and then integrating the derivatives with respect to , we find
[TABLE]
where we have also used that . Finally, setting
[TABLE]
we see that (4.3) reads , so that identifies with the second weak derivative of with respect to . Notice that is continuous by the continuity of , , , , and the fact that and are finite due to (4.3), (4.4), and (4.1.2). The proof is therefore completed. ∎
Thanks to Lemma 4.11 and Proposition 4.12 we have that . As a byproduct of this, by the Dynamic Programming Principle and standard means based on an application of Dynkin’s formula, we obtain the next result.
Proposition 4.13**.**
Recall the second-order differential operator defined in (4.1.2). The value function of Problem (P3) is a classical solution to the HJB equation
[TABLE]
with boundary condition for any .
Acknowledgments
The research of Claudia Ceci is partially supported by “Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni” (GNAMPA) of “Istituto Nazionale di Alta Matematica” (INdAM). Financial support by the German Research Foundation (DFG) through the Collaborative Research Centre 1283 is gratefully acknowledged by Giorgio Ferrari. We wish to thank Luciano Campi, Tiziano De Angelis, Paola Mannucci, Fabio Paronetto, Paavo Salminen, and Wolfgang Runggaldier for useful discussions.
Appendix A Filtering Results
Proof of Proposition 3.5.
Since the innovation processes (see (3.3)) and the random measure (see (3.6) and (3.8)) are -adapted, then . In general, the latter inclusion could be strict. Let us now consider the exponential -martingale solving
[TABLE]
and define a probability measure on , equivalent to on , and such that
[TABLE]
Notice that Assumption (3.1) ensures that is indeed an -martingale. By Girsanov’s theorem, the processes
[TABLE]
are -independent Brownian motions. We now prove that On the one hand, the inclusion follows from the fact that and turn out to be -adapted since they can be written as
[TABLE]
To prove the converse, let us observe that, under the probability measure , the process and solve the following stochastic differential equations
[TABLE]
[TABLE]
respectively. Clearly is -adapted. Recalling (3.16), the solution to equation (A.4) can be constructed iteratively. More precisely, , the process solves
[TABLE]
and for any time between two consecutive jump times, i.e. , , one has
[TABLE]
By Assumption 2.1, this sequence stochastic differential equations has a unique strong solution on any interval , and this in turn gives the unique strong solution to (A.4). Moreover, turns out to be -adapted.
Then, by applying Corollary III.4.3.1 in [30], we have that every -local martingale admits the decomposition
[TABLE]
where and are -adapted processes, and is an -predictable process indexed by , such that for all
[TABLE]
Let now be a -local martingale, then is a -local martingale, where
[TABLE]
Taking into account (A.3), we have that solves
[TABLE]
and by applying the product formula to , we easily obtain that
[TABLE]
To conclude, we thus only need to set
[TABLE]
∎
Proof of Theorem 3.6.
In order to derive the filtering equation solved by , we apply the innovation approach (see, for instance, Chapter IV in [4]). In this proof we shall use the two well-known facts:
- (i)
for every -martingale , the projection over is an -martingale; that is, , , is an -martingale;
- (ii)
for any -progressively measurable and integrable process we have that
[TABLE]
is an -martingale.
The first step of the innovation method consists in writing the process , , as a semimartingale. Denoting by the Markov generator of the state process , we have that
[TABLE]
where . Hence, for any , we can write
[TABLE]
where is an -martingale. By taking the conditional expectation with respect to , and using (i) and (ii) above, we obtain that
[TABLE]
where is an -martingale null at zero. Proposition 3.5 ensures the existence of processes and that are -predictable, and which is -predictable and indexed by , such that
[TABLE]
with , and , . To obtain equation (3.14) it only remains to prove that
[TABLE]
[TABLE]
with given in (3.15).
Following the same lines of the proof of Theorem 3.1 in [11] we can derive the structure of the processes , by imposing the following equalities
[TABLE]
where and are the -Brownian motions defined in (A.2). To derive the expression of , we consider a bounded process of the form , with -predictable process indexed by . Since is -adapted, the equality
[TABLE]
holds. By applying the product rule (taking into account no common jumps between and ) we obtain
[TABLE]
where is the -dual predictable projection of given in (3.12), and is an -martingale. By projection onto , and denoting by an -martingale, we have that
[TABLE]
On the other hand, the product rule and (A.5) and (A.6) yield
[TABLE]
Recalling that is the -dual predictable projection of given in (3.11), we find
[TABLE]
where, again, is an -martingale.
Gathering equations (A.7), (A.8), and (A.9), we obtain that for a.e.
[TABLE]
Choose now of the form , with any bounded, -predictable, positive process and . Observe that is bounded since , with a positive constant. Then the following equality holds on
[TABLE]
where we have set
[TABLE]
Thus, on ,
[TABLE]
Finally, since the counting process is nonexplosive, a.s. for , and this yields (3.15). ∎
Proof of Proposition 3.10.
By Proposition 3.7, equations (3.14) and (3.21) are equivalent to a system of recursive equations between consecutive jump times, i.e. for ,
[TABLE]
[TABLE]
where we have set
[TABLE]
[TABLE]
with the update at time given by
[TABLE]
Recall that, by assumption, the function given in (3.4) is locally-Lipschitz with respect to and satisfies a (global) sublinear growth condition with respect to , uniformly in .
We now develop the proof of uniqueness by distinguishing among three different cases related to the jumps’ amplitude ; namely, , , and .
In the case , we have that b^{\pi}(\underline{y},q,i):=\sum_{j=1}^{Q}\lambda_{ji}y_{j}-y_{i}\Big{[}\lambda^{N}(i)-\sum_{j=1}^{Q}\lambda^{N}(j)y_{j}\Big{]}, and it is easy to verify that between two consecutive jump times the pair solves a -dimensional stochastic differential equation with coefficients satisfying locally-Lipschitz conditions and (global) sublinear growth conditions with respect to , uniformly in . As a consequence, strong uniqueness holds between two consecutive jump times; i.e. for , . Moreover, since the update at jump time (see (A.10)) depends on the process for , we have strong uniqueness of the solution to system (3.14) and (3.21) for all .
In the case , equations (3.14) and (3.21) reduce to
[TABLE]
[TABLE]
where, in particular, . It is easy to check that also in this case strong uniqueness follows by the locally-Lipschitz property of the coefficients and by their (global) sublinear growth condition.
In the case the jumps’ amplitude can assume any possible real value. In particular, can be such that and do not have only common jumps: might jump at a time at which , so that does not jump at that time. The treatment of this case is more delicate and should be performed separately. Indeed, the uniqueness cannot be proved by using the arguments employed in the previous two cases because of the presence of in the coefficient which prevents to prove Lipschitz-continuity of with respect to . However, one might prove uniqueness by relying on the filtered martingale problem associated to the infinitesimal generator of the triplet . We refer to the seminal paper [36] and to Theorem 3.3 and Appendix B in the more recent [11].
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aronson, D.G. (1967). Bounds for the Fundamental Solution of a Parabolic Equation. Bull. Amer. Math. Soc. 73(6) pp. 890–896.
- 2[2] Baldursson, F.M., Karatzas, I. ( 1997 ) 1997 (1997) . Irreversible Investment and Industry Equilibrium. Finance Stoch. 1 pp. 69–89.
- 3[3] Borodin, A.N., Salminen, P. (2015). Handbook of Brownian Motion - Facts and Formulae (2nd edition). Birkhäuser.
- 4[4] Brémaud, P. (1980). Point Processes and Queues: Martingale Dynamics . Springer-Verlag.
- 5[5] Cadenillas, A., Huamán-Aguilar, R. ( 2016 ) 2016 (2016) . Explicit Formula for the Optimal Government Debt Ceiling. Ann. Oper. Res. 247(2) pp. 415–449.
- 6[6] Cadenillas, A., Huamán-Aguilar, R. (2018). On the Failure to Reach the Optimal Government Debt Ceiling. Forthcoming on Risks .
- 7[7] Ceci, C., Gerardi, A. (1998). Partially Observed Control of a Markov Jump Process with Counting Observations: Equivalence with the Separated Problem. Stoch. Process. Appl. 78 , pp. 245–260.
- 8[8] Ceci, C., Gerardi, A. (2000). Filtering of a Markov Jump Process with Counting Observations. Appl. Math. Optim. 42(1) , pp. 1–18.
