Optimal Reduction of Public Debt under Partial Observation of the   Economic Growth

Giorgia Callegaro; Claudia Ceci; Giorgio Ferrari

arXiv:1901.08356·math.OC·January 29, 2019

Optimal Reduction of Public Debt under Partial Observation of the Economic Growth

Giorgia Callegaro, Claudia Ceci, Giorgio Ferrari

PDF

TL;DR

This paper models optimal public debt reduction considering partial economic information, using stochastic control and filtering techniques, and provides explicit policies in a two-regime economic setting.

Contribution

It formulates a novel partial observation control model with regime-switching and derives explicit optimal policies in a simplified two-regime case.

Findings

01

Reduction of the complex control problem to a full information problem via filtering.

02

Explicit characterization of optimal debt reduction policy in a two-regime economy.

03

Development of a free boundary in the associated optimal stopping problem.

Abstract

We consider a government that aims at reducing the debt-to-gross domestic product (GDP) ratio of a country. The government observes the level of the debt-to-GDP ratio and an indicator of the state of the economy, but does not directly observe the development of the underlying macroeconomic conditions. The government's criterion is to minimize the sum of the total expected costs of holding debt and of debt's reduction policies. We model this problem as a singular stochastic control problem under partial observation. The contribution of the paper is twofold. Firstly, we provide a general formulation of the model in which the level of debt-to-GDP ratio and the value of the macroeconomic indicator evolve as a diffusion and a jump-diffusion, respectively, with coefficients depending on the regimes of the economy. These are described through a finite-state continuous-time Markov chain. We…

Equations408

dX^{0}_{t}=\big{(}r-g(Z_{t})\big{)}X^{0}_{t}dt+\sigma X^{0}_{t}dW_{t},\quad X^{0}_{0}=x\in(0,\infty),

dX^{0}_{t}=\big{(}r-g(Z_{t})\big{)}X^{0}_{t}dt+\sigma X^{0}_{t}dW_{t},\quad X^{0}_{0}=x\in(0,\infty),

d η_{t} = b_{1} (η_{t}, Z_{t}) d t + σ_{1} (η_{t}) d W_{t} + σ_{2} (η_{t}) d B_{t} + c (η_{t^{-}}, Z_{t^{-}}) d N_{t}, η_{0} = q \in I,

d η_{t} = b_{1} (η_{t}, Z_{t}) d t + σ_{1} (η_{t}) d W_{t} + σ_{2} (η_{t}) d B_{t} + c (η_{t^{-}}, Z_{t^{-}}) d N_{t}, η_{0} = q \in I,

∣ b_{1} (q, i) - b_{1} (q^{'}, i) ∣ + ∣ σ_{1} (q) - σ_{1} (q^{'}) ∣ + ∣ σ_{2} (q) - σ_{2} (q^{'}) ∣ + ∣ c (q, i) - c (q^{'}, i) ∣ \leq L_{R} ∣ q - q^{'} ∣;

∣ b_{1} (q, i) - b_{1} (q^{'}, i) ∣ + ∣ σ_{1} (q) - σ_{1} (q^{'}) ∣ + ∣ σ_{2} (q) - σ_{2} (q^{'}) ∣ + ∣ c (q, i) - c (q^{'}, i) ∣ \leq L_{R} ∣ q - q^{'} ∣;

∣ b_{1} (q, i) ∣^{2} + ∣ σ_{1} (q) ∣^{2} + ∣ σ_{2} (q) ∣^{2} + ∣ c (q, i) ∣^{2} \leq C (1 + ∣ q ∣^{2}) .

∣ b_{1} (q, i) ∣^{2} + ∣ σ_{1} (q) ∣^{2} + ∣ σ_{2} (q) ∣^{2} + ∣ c (q, i) ∣^{2} \leq C (1 + ∣ q ∣^{2}) .

H := F^{X^{0}} \lor F^{η},

H := F^{X^{0}} \lor F^{η},

H \subset F .

H \subset F .

dX^{\nu}_{t}=\big{(}r-g(Z_{t})\big{)}X^{\nu}_{t}dt+\sigma X^{\nu}_{t}dW_{t}-d\nu_{t},\quad X^{\nu}_{0^{-}}=x>0.

dX^{\nu}_{t}=\big{(}r-g(Z_{t})\big{)}X^{\nu}_{t}dt+\sigma X^{\nu}_{t}dW_{t}-d\nu_{t},\quad X^{\nu}_{0^{-}}=x>0.

\displaystyle\mathcal{M}(x,\underline{y},q):=\Big{\{}\nu:\Omega\times\mathbb{R}_{+}\rightarrow\mathbb{R}_{+}:{(\nu_{t}(\omega):=\nu(\omega,t))}_{t\geq 0}\ \textrm{is nondecreasing, right-continuous,}

\displaystyle\mathcal{M}(x,\underline{y},q):=\Big{\{}\nu:\Omega\times\mathbb{R}_{+}\rightarrow\mathbb{R}_{+}:{(\nu_{t}(\omega):=\nu(\omega,t))}_{t\geq 0}\ \textrm{is nondecreasing, right-continuous,}

\displaystyle\hskip 8.5359pt\mathbb{H}-\textrm{adapted, such that}\,X_{t}^{\nu}\geq 0\ \textrm{for every}\ t\geq 0,\ X^{\nu}_{0^{-}}=x,\,\,\mathsf{P}(Z_{0}=i)=y_{i},\,\,i\in S,\,\,\eta_{0}=q\ \textrm{a.s.}\Big{\}},

\mathcal{Y}:=\Big{\{}{\underline{y}}=(y_{1},\dots y_{Q}):y_{i}\in[0,1],\,\,\,i=1,\dots Q,\,\,\,\sum_{i=1}^{Q}y_{i}=1\Big{\}},

\mathcal{Y}:=\Big{\{}{\underline{y}}=(y_{1},\dots y_{Q}):y_{i}\in[0,1],\,\,\,i=1,\dots Q,\,\,\,\sum_{i=1}^{Q}y_{i}=1\Big{\}},

X_{t}^{x,\nu}=X^{1,0}_{t}\bigg{[}x-\int_{0}^{t}\frac{d\nu_{s}}{X^{1,0}_{s}}\bigg{]},\qquad t\geq 0,\quad X_{0^{-}}^{x,\nu}=x,

X_{t}^{x,\nu}=X^{1,0}_{t}\bigg{[}x-\int_{0}^{t}\frac{d\nu_{s}}{X^{1,0}_{s}}\bigg{]},\qquad t\geq 0,\quad X_{0^{-}}^{x,\nu}=x,

X_{t}^{1, 0} = e^{\int_{0}^{t} (r - g (Z_{s})) d s - \frac{1}{2} σ^{2} t + σ W_{t}}, t \geq 0.

X_{t}^{1, 0} = e^{\int_{0}^{t} (r - g (Z_{s})) d s - \frac{1}{2} σ^{2} t + σ W_{t}}, t \geq 0.

\left\{\begin{array}[]{rcl}dD_{t}&=&rD_{t}dt-d\xi_{t},\qquad\quad\,\,\quad D_{0^{-}}=d>0,\\ dY_{t}&=&g(Z_{t})Y_{t}dt+\sigma Y_{t}dW_{t},\quad Y_{0}=y>0,\\ \end{array}\right.

\left\{\begin{array}[]{rcl}dD_{t}&=&rD_{t}dt-d\xi_{t},\qquad\quad\,\,\quad D_{0^{-}}=d>0,\\ dY_{t}&=&g(Z_{t})Y_{t}dt+\sigma Y_{t}dW_{t},\quad Y_{0}=y>0,\\ \end{array}\right.

\mathsf{E}\bigg{[}\int_{0}^{\infty}e^{-\rho t}h\big{(}X_{t}^{x,0},i\big{)}dt\bigg{]}+\mathsf{E}\bigg{[}\int_{0}^{\infty}e^{-\rho t}X_{t}^{1,0}h_{x}\big{(}X_{t}^{x,0},i\big{)}dt\bigg{]}<\infty.

\mathsf{E}\bigg{[}\int_{0}^{\infty}e^{-\rho t}h\big{(}X_{t}^{x,0},i\big{)}dt\bigg{]}+\mathsf{E}\bigg{[}\int_{0}^{\infty}e^{-\rho t}X_{t}^{1,0}h_{x}\big{(}X_{t}^{x,0},i\big{)}dt\bigg{]}<\infty.

\mathcal{J}_{x,\underline{y},q}(\nu):=\mathsf{E}_{(x,\underline{y},q)}\bigg{[}\int_{0}^{\infty}e^{-\rho t}h\big{(}X_{t}^{x,\nu},Z_{t}\big{)}dt+\int_{0}^{\infty}e^{-\rho t}d\nu_{t}\bigg{]},\quad\nu\in\mathcal{M}(x,\underline{y},q).

\mathcal{J}_{x,\underline{y},q}(\nu):=\mathsf{E}_{(x,\underline{y},q)}\bigg{[}\int_{0}^{\infty}e^{-\rho t}h\big{(}X_{t}^{x,\nu},Z_{t}\big{)}dt+\int_{0}^{\infty}e^{-\rho t}d\nu_{t}\bigg{]},\quad\nu\in\mathcal{M}(x,\underline{y},q).

(P1) V_{p o} (x, \underline{y}, q) := ν \in M (x, \underline{y}, q) in f J_{x, \underline{y}, q} (ν), (x, \underline{y}, q) \in (0, \infty) \times Y \times I .

(P1) V_{p o} (x, \underline{y}, q) := ν \in M (x, \underline{y}, q) in f J_{x, \underline{y}, q} (ν), (x, \underline{y}, q) \in (0, \infty) \times Y \times I .

\pi_{t}(f):=\mathsf{E}\big{[}f(Z_{t})\big{|}\mathcal{H}_{t}\big{]},

\pi_{t}(f):=\mathsf{E}\big{[}f(Z_{t})\big{|}\mathcal{H}_{t}\big{]},

π_{t} (f_{i}) = P (Z_{t} = i ∣ H_{t}), i \in S,

π_{t} (f_{i}) = P (Z_{t} = i ∣ H_{t}), i \in S,

I_{t}:=W_{t}-\int_{0}^{t}\sigma^{-1}\big{(}\pi_{s}(\beta)-\beta(Z_{s})\big{)}ds,\quad I^{1}_{t}:=B_{t}-\int_{0}^{t}\big{(}\pi_{s}(\alpha(\eta_{s},\,\cdot\,))-\alpha(\eta_{s},Z_{s})\big{)}ds,

I_{t}:=W_{t}-\int_{0}^{t}\sigma^{-1}\big{(}\pi_{s}(\beta)-\beta(Z_{s})\big{)}ds,\quad I^{1}_{t}:=B_{t}-\int_{0}^{t}\big{(}\pi_{s}(\alpha(\eta_{s},\,\cdot\,))-\alpha(\eta_{s},Z_{s})\big{)}ds,

\alpha(q,i):=\sigma_{2}(q)^{-1}\Big{\{}b_{1}(q,i)-\sigma^{-1}\beta(i)\sigma_{1}(q)\Big{\}},\quad(q,i)\in\mathcal{I}\times S.

\alpha(q,i):=\sigma_{2}(q)^{-1}\Big{\{}b_{1}(q,i)-\sigma^{-1}\beta(i)\sigma_{1}(q)\Big{\}},\quad(q,i)\in\mathcal{I}\times S.

E [e^{\frac{1}{2} \int_{0}^{t} α^{2} (η_{s}, Z_{s}) d s}] < \infty, for any t \geq 0.

E [e^{\frac{1}{2} \int_{0}^{t} α^{2} (η_{s}, Z_{s}) d s}] < \infty, for any t \geq 0.

m (d t, d q) := s : Δ η_{s} \neq = 0 \sum δ_{(s, Δ η_{s})} (d s, d q),

m (d t, d q) := s : Δ η_{s} \neq = 0 \sum δ_{(s, Δ η_{s})} (d s, d q),

\int_{0}^{t} c (η_{s^{-}}, Z_{s^{-}}) \mathds 1_{{c (η_{s^{-}}, Z_{s^{-}}) \neq = 0}} d N_{s} = \int_{0}^{t} \int_{R} q m (d s, d q), t > 0.

\int_{0}^{t} c (η_{s^{-}}, Z_{s^{-}}) \mathds 1_{{c (η_{s^{-}}, Z_{s^{-}}) \neq = 0}} d N_{s} = \int_{0}^{t} \int_{R} q m (d s, d q), t > 0.

F_{t}^{m} := σ {m ((0, s] \times A) : 0 \leq s \leq t, A \in B (R)},

F_{t}^{m} := σ {m ((0, s] \times A) : 0 \leq s \leq t, A \in B (R)},

\mathsf{E}\bigg{[}\int_{0}^{\infty}\int_{\mathbb{R}}\Phi(s,q)m(ds,dq)\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{\infty}\int_{\mathbb{R}}\Phi(s,q)m^{p,\mathbb{G}}(dt,dq)\bigg{]}.

\mathsf{E}\bigg{[}\int_{0}^{\infty}\int_{\mathbb{R}}\Phi(s,q)m(ds,dq)\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{\infty}\int_{\mathbb{R}}\Phi(s,q)m^{p,\mathbb{G}}(dt,dq)\bigg{]}.

m^{π} (d t, d q) := m (d t, d q) - m^{p, H} (d t, d q),

m^{π} (d t, d q) := m (d t, d q) - m^{p, H} (d t, d q),

m^{p, H} (d t, d q) = i = 1 \sum Q π_{t^{-}} (i) λ^{N} (i) \mathds 1_{{c (η_{t^{-}}, i) \neq = 0}} δ_{c (η_{t^{-}}, i)} (d q) d t,

m^{p, H} (d t, d q) = i = 1 \sum Q π_{t^{-}} (i) λ^{N} (i) \mathds 1_{{c (η_{t^{-}}, i) \neq = 0}} δ_{c (η_{t^{-}}, i)} (d q) d t,

m^{p, F} (d t, d q) := λ^{N} (Z_{t^{-}}) \mathds 1_{{c (η_{t^{-}}, Z_{t^{-}}) \neq = 0}} δ_{c (η_{t^{-}}, Z_{t^{-}})} (d q) d t .

m^{p, F} (d t, d q) := λ^{N} (Z_{t^{-}}) \mathds 1_{{c (η_{t^{-}}, Z_{t^{-}}) \neq = 0}} δ_{c (η_{t^{-}}, Z_{t^{-}})} (d q) d t .

N_{t} (A) := m ((0, t] \times A) = s \leq t \sum \mathds 1_{{Δ η_{s} \in A ∖ {0}}}, t \geq 0.

N_{t} (A) := m ((0, t] \times A) = s \leq t \sum \mathds 1_{{Δ η_{s} \in A ∖ {0}}}, t \geq 0.

\displaystyle\mathsf{E}\bigg{[}\int_{0}^{t}\!\!C_{s}\,d\mathcal{N}_{s}(A)\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{t}\!\!\!C_{s}\mathds{1}_{\{c(\eta_{s^{-}},Z_{s^{-}})\in A\setminus\{0\}\}}dN_{s}\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{t}\!\!\!C_{s}\mathds{1}_{\{c(\eta_{s^{-}},Z_{s^{-}})\in A\setminus\{0\}\}}\lambda^{N}(Z_{s^{-}})ds\bigg{]}.

\displaystyle\mathsf{E}\bigg{[}\int_{0}^{t}\!\!C_{s}\,d\mathcal{N}_{s}(A)\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{t}\!\!\!C_{s}\mathds{1}_{\{c(\eta_{s^{-}},Z_{s^{-}})\in A\setminus\{0\}\}}dN_{s}\bigg{]}=\mathsf{E}\bigg{[}\int_{0}^{t}\!\!\!C_{s}\mathds{1}_{\{c(\eta_{s^{-}},Z_{s^{-}})\in A\setminus\{0\}\}}\lambda^{N}(Z_{s^{-}})ds\bigg{]}.

π_{t^{-}} (λ^{N} (.) \mathds 1_{{c (η_{t^{-}}, .) \in A ∖ {0}}}) = i = 1 \sum Q π_{t^{-}} (i) λ^{N} (i) \mathds 1_{{c (η_{t^{-}}, i) \in A ∖ {0}}}, \forall A \in B (R) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimal Reduction of Public Debt under Partial Observation of the Economic Growth

Giorgia Callegaro, Claudia Ceci, and Giorgio Ferrari

G. Callegaro: Department of Mathematics “Tullio Levi-Civita”, University of Padova, Via Trieste, 35121 Padova, Italy

[email protected]

C. Ceci: Department of Economics, University “G. D’annunzio” of Chieti-Pescara, Viale Pindaro 42, I-65127 Pescara, Italy

[email protected]

G. Ferrari: Center for Mathematical Economics (IMW), Bielefeld University, Universitätsstrasse 25, 33615 Bielefeld, Germany

[email protected]

Abstract.

We consider a government that aims at reducing the debt-to-gross domestic product (GDP) ratio of a country. The government observes the level of the debt-to-GDP ratio and an indicator of the state of the economy, but does not directly observe the development of the underlying macroeconomic conditions. The government’s criterion is to minimize the sum of the total expected costs of holding debt and of debt’s reduction policies. We model this problem as a singular stochastic control problem under partial observation. The contribution of the paper is twofold. Firstly, we provide a general formulation of the model in which the level of debt-to-GDP ratio and the value of the macroeconomic indicator evolve as a diffusion and a jump-diffusion, respectively, with coefficients depending on the regimes of the economy. These are described through a finite-state continuous-time Markov chain. We reduce via filtering techniques the original problem to an equivalent one with full information (the so-called separated problem), and we provide a general verification result in terms of a related optimal stopping problem under full information. Secondly, we specialize to a case study in which the economy faces only two regimes, and the macroeconomic indicator has a suitable diffusive dynamics. In this setting we provide the optimal debt reduction policy. This is given in terms of the continuous free boundary arising in an auxiliary fully two-dimensional optimal stopping problem.

Keywords: singular stochastic control; partial observation; filtering; separated problem; optimal stopping; free boundary; debt-to-GDP ratio.

MSC2010 subject classification: 93E20, 60G35, 93E11, 60G40, 60J60, 91B64.

1. Introduction

The question of optimally managing debt-to-GDP ratio (also called “debt ratio”) of a country has become particularly important in the latest years. Indeed, concurrently with the financial crisis started in 2007, debt-to-GDP ratio exploded from an average of 53% to circa 80% in developed countries. Clearly, the debt management policy of a government highly depends on the underlying macroeconomic conditions; indeed, these affect, for example, the growth rate of GDP which, in turn, determines the growth rate of the debt-to-GDP ratio of a country. However, in practice it is typically neither possible to measure in real-time the growth rate of GDP, nor one can directly observe the underlying business cycles. On August 24, 2018, Jerome H. Powell, Chairman of the Federal Reserve, said:111Speech at “Changing Market Structure and Implications for Monetary Policy”, a symposium sponsored by the Federal Reserve Bank of Kansas City in Jackson Hole, Wyoming.

…In conventional models of the economy, major economic quantities such as inflation, unemployment, and the growth rate of gross domestic product fluctuate around values that are considered “normal”, or “natural” or “desired”. The FMOC (Federal Open Market Committee) has chosen a 2 percent inflation objective as one of these desired values. The other values are not directly observed, nor can they be chosen by anyone…

Following an idea that dates back to [28], in this paper we suppose that the GDP growth rate of a country is modulated by a continuous-time Markov chain that is not directly observable. The Markov chain has $Q\geq 2$ states modeling the different business cycles of the economy, so that a shift in the macroeconomic conditions induces a change in the value of the growth rate of GDP. The government can observe only the current levels of the debt-to-GDP ratio and of a macroeconomic indicator. The latter might be, e.g., one of the so-called “Big Four”222These indicators constitute the Conference Board’s Index of Coincident Indicators; they are employment in non agricultural businesses, industrial production, real personal income less transfers, and real manufacturing and trade sales. We refer to, e.g., [44], where the authors present a wide range of economic indicators and examine the forecasting performance of various of them in the recession of 2001., which are usually considered proxies of the industrial production index, hence of the business conditions.

The government may intervene in order to decrease the level of the debt ratio, e.g. through fiscal policies or imposing austerity policies in the form of spending cuts. We assume that the debt ratio is instantaneously affected by any such policy. Debt reductions must not necessarily be performed at rates, but also lump sum actions are allowed, and the cumulative amount of debt ratio’s decrease is the government’s control variable. Any decrease of the debt ratio results in proportional costs, and the government aims at choosing a debt-reduction policy that minimizes the total expected costs of holding debt, plus the total expected costs of interventions on the debt ratio. In line with recent papers on stochastic control methods for optimal debt management (see [5], [6], [23] and [24]), we model the previous problem as a singular stochastic control problem. However, differently to all the previous works, our problem is formulated in a partial observation setting, thus leading to a completely different mathematical analysis. In our model, the observations consist of the debt ratio and of the macroeconomic indicator. The debt ratio is a linearly controlled geometric Brownian motion, and its drift is given in terms of the GDP growth rate, which is modulated by the unobservable continuous-time Markov chain $Z$ . The macroeconomic indicator is a real-valued jump-diffusion which is correlated to the debt ratio process, and which has drift, and both intensity and jump sizes, depending on $Z$ .

Our Contributions. Our study of the optimal debt reduction problem is performed thought three main steps.

First of all, via advanced filtering techniques with mixed-type observations, we reduce the original problem to an equivalent problem under full information, the so-called separated problem. The filtering problem consists in characterizing the conditional distribution of the unobservable Markov chain $Z$ , at any time $t$ , given observations up to time $t$ . The case of diffusion observations has been widely studied in literature and textbook treatments can be found in [20], [32], and [37]. There are also known results for pure-jump observations (see, e.g., [4], [8], [9], [35], and references therein). More recently, filtering problems with mixed-type information, which involve pure-jump processes and diffusions, have been studied in [11], [12], and [25]. Due to the structure of our observations’ dynamics we cannot apply the probability reference method (see [12] and [46]), and for this reason we choose an alternative route based on the innovation approach, which leads to the Kushner-Stratonovich equation. Moreover, differently to [11] and [25], in our framework the innovation process is two-dimensional and, therefore, the innovation method employed in these papers must be suitably adapted to our context. By showing that the Kushner-Stratonovich equation admits a unique strong solution, we are then able to prove that the original problem under partial observation and the separated problem are equivalent in the sense that they share the same value and the same optimal control.

Secondly, we exploit the convex structure of the separated problem, and we provide a general probabilistic verification theorem. This result - which is in line with findings in [2], [15] and [23], among others - relates the optimal control process to the solution to an auxiliary optimal stopping problem. Moreover, it proves that the value function of the separated problem is the integral - with respect to the controlled state variable - of the value function of the optimal stopping problem. The stopping problem thus gives the optimal timing at which one additional unit of debt should be reduced.

Finally, by specifying a setting in which the continuous-time Markov chain faces only two regimes (a fast growth or slow growth phase) and the macroeconomic indicator is a suitable diffusion process, we are able to characterize the optimal debt reduction policy. In this framework, the filter process is a two-dimensional process $(\pi_{t},1-\pi_{t})_{t\geq 0}$ , where $\pi_{t}$ is the conditional probability at time $t$ that the economy enjoys the fast growth phase. We prove that the optimal control prescribes to keep at any time the debt ratio below an endogenously determined curve that is a function of government’s belief about the current state of the economy. Such a debt ceiling is the free boundary of the fully two-dimensional optimal stopping problem that is related to the separated problem in the sense of the previously discussed verification theorem. By using almost exclusively probabilistic means, we are able to show that the value function of the auxiliary optimal stopping problem is a $C^{1}$ -function of its arguments, and thus enjoys the so-called smooth-fit property. Moreover, the free boundary is a continuous, bounded, and increasing function of the filter process. This last monotonicity property has also a clear economic interpretation: the more the government believes that the economy enjoys a regime of fast growth, the less strict the optimal debt reduction policy should be.

As a remarkable byproduct of the regularity of the value function of the optimal stopping problem, we also obtain that the value function of the singular stochastic control problem is a classical solution to its associated Hamilton-Jacobi-Bellman (HJB) equation. The latter takes the form of a variational inequality involving an elliptic second-order partial differential equation (PDE). It is worth noticing that the $C^{2}$ regularity of the value function implies the validity of a second-order principle of smooth fit, usually observed in one-dimensional problems.

We believe that the study of the auxiliary fully two-dimensional optimal stopping problem is a valuable contribution to the literature on its own. Indeed, if the literature on one-dimensional optimal stopping problems is very rich, the problem of characterizing the optimal stopping rule in multi-dimensional settings has been so far rarely explored in the literature (see the recent [13], [15] and [31], among the very few papers dealing with multi-dimensional stopping problems). This discrepancy is due to the fact that a standard guess-and-verify approach, based on the construction of an explicit solution to the variational inequality arising in the considered optimal stopping problem, is not anymore applicable in multi-dimensional settings where the variational inequality involves a PDE rather than an ordinary differential equation.

Related Literature. As already noticed above, our paper is placed among those recent works addressing the problem of optimal debt management via continuous-time stochastic control techniques. In particular, [5] and [6] model an optimal debt reduction problem as a one-dimensional control problem with singular and bounded-velocity controls, respectively. In [24] the government is allowed to increase and decrease the current level of debt ratio, and the interest rate on debt is modulated by a continuous-time observable Markov chain. The mathematical formulation leads to a one-dimensional bounded-variation stochastic control problem with regime switching. In [23], when optimally reducing the debt ratio, the government takes into consideration the evolution of the inflation rate of the country. The latter evolves as an uncontrolled diffusion process and affects the growth rate of the debt ratio, which is a process of bounded variation. In this setting, the debt reduction problem is formulated as a two-dimensional singular stochastic control problem whose HJB equation involves a second-order linear parabolic partial differential equation. All the previous papers are formulated in a full information setting, while ours is under partial observation.

The literature on singular stochastic control problems under partial observation is also still quite limited. Theoretical results on the PDE characterization of the value function of a two-dimensional optimal correction problem under partial observation are obtained in [38], whereas a general maximum principle for a not necessarily Markovian singular stochastic control problem under partial information has been more recently derived in [39]. We also refer to [14] and [19], where it is provided a thorough study of the optimal dividend strategy in models in which the surplus process evolves as drifted Brownian motion with unknown drift that can take only two constant values, with given probability.

Outline of the Paper. The rest of the paper is organized as follows. In Section 2 we introduce the setting and formulate the problem. The reduction of the problem under partial observation to the separated problem is performed in Section 3; in particular, the filtering results are presented in Section 3.1. The probabilistic verification theorem connecting the separated problem to one of optimal stopping is then proved in Section 3.3. In Section 4 we then consider a case study in which the economy faces only two regimes. Its solution, presented in Sections 4.2 and 4.3, hinges on the study of a two-dimensional optimal stopping problem that is performed in Section 4.1. Finally, Appendix A collects the proofs of some technical filtering results.

2. Setting and Problem Formulation

2.1. The Setting

Consider the complete filtered probability space $(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\geq 0},\mathsf{P})$ , capturing all the uncertainty of our setting. Here, $\mathbb{F}:={(\mathcal{F}_{t})}_{t\geq 0}$ denotes the full information filtration. We suppose that such a filtration satisfies the usual hypotheses of completeness and right-continuity.

We denote by $Z$ a continuous-time finite-state Markov chain describing the different states of the economy. For $Q\geq 2$ , let $S:=\{1,2,\dots,Q\}$ be the state space of $Z$ and $\{\lambda_{ij}\}_{1\leq i,j\leq Q}$ its generator matrix. Here, $\lambda_{ij}$ , $i\neq j$ , gives the intensity of a transition from state $i$ to state $j$ , and it is such that $\lambda_{ij}\geq 0$ , for $i\neq j$ , and $\sum_{j=1,j\neq i}^{Q}\lambda_{ij}=-\lambda_{ii}$ . For any time $t\geq 0$ , $Z_{t}$ is $\mathcal{F}_{t}$ -measurable.

In absence of any intervention by the government, we assume that the (uncontrolled) debt-to-GDP ratio evolves as

[TABLE]

where $W$ is a standard $\mathbb{F}$ -Brownian motion on $(\Omega,\mathcal{F})$ independent of $Z$ , $r\geq 0$ and $\sigma>0$ are constants, and $g:S\rightarrow\mathbb{R}$ . The constant $r$ is the real interest rate on debt, $\sigma$ is the debt’s volatility, and $g(i)\in\mathbb{R}$ is the rate of the GDP’s growth when the economy is in state $i\in S$ .

It is clear that equation (2.1) admits a unique strong solution, and, when needed, for any $x>0$ we shall denote it by $X^{x,0}$ . The current level of the debt-to-GDP ratio is known to the government at any time $t$ , and $X^{x,0}$ is therefore the first component of the so-called observation process.

The government also observes a macroeconomic stochastic indicator $\eta$ , e.g. one of the so-called “Big Four”, which we interpret as a proxy of the business conditions. We assume that $\eta$ is a jump-diffusion process solving the stochastic differential equation

[TABLE]

where $b_{1}$ , $c$ , $\sigma_{1}>0$ , and $\sigma_{2}>0$ are measurable functions of their arguments, and $\mathcal{I}\subseteq\mathbb{R}$ is the state space of $\eta$ . Here, $B$ is an $\mathbb{F}$ -standard Brownian motion, independent of $W$ and $Z$ . Moreover, $N$ is an $\mathbb{F}$ -adapted point process, without common jump times with $Z$ , independent of $W$ and $B$ . The predictable intensity of $N$ is denoted by $\{\lambda^{N}(Z_{t^{-}})\}_{t\geq 0}$ and depends on the current state of the economy, with $\lambda^{N}(\,\cdot\,)>0$ being a measurable function. From now on, we assume the following assumptions that ensure strong existence and uniqueness of the solution to equation (2.2) (see [26], among others).

Assumption 2.1.

The functions $b_{1}:\mathcal{I}\times S\to\mathbb{R}$ , $\sigma_{1}:\mathcal{I}\to(0,\infty)$ , $\sigma_{2}:\mathcal{I}\to(0,\infty)$ , and $c:\mathcal{I}\times S\to\mathbb{R}$ are such that for any $i\in S$ :

(i)

(Continuity) $b_{1}(\cdot,i)$ , $\sigma_{1}(\cdot)$ , $\sigma_{2}(\cdot)$ and $c(\cdot,i)$ are continuous;

(ii)

(Local Lipschitz conditions) for any $R>0$ , there exists a constant $L_{R}>0$ such that if $|q|<R,|q^{\prime}|<R$ , $q,q^{\prime}\in\mathcal{I}$ , then

[TABLE]

(iii)

(Growth conditions) there exists a constant $C>0$ such that

[TABLE]

The dynamics proposed in equation (2.2) is of a jump-diffusive type, and it allows for a size and intensity of the jumps affected by the state of the economy. It is therefore flexible enough to describe a large class of stochastic factors which may exhibit jumps.

The observation filtration $\mathbb{H}={(\mathcal{H}_{t})}_{t\geq 0}$ is defined as

[TABLE]

where $\mathbb{F}^{X^{0}}$ and $\mathbb{F}^{\eta}$ denote the natural filtrations generated by $X^{0}$ and $\eta$ , respectively, as usual augmented by $\mathsf{P}$ -null sets. Clearly, $(X^{0},\eta)$ is adapted to both $\mathbb{H}$ and $\mathbb{F}$ , and

[TABLE]

The above inclusion means that the government cannot directly observe the state of the economy $Z$ , but this has to be inferred through the observation of $(X^{0},\eta)$ . We are therefore working in a partial information setting.

2.2. The Optimal Debt Reduction Problem

The government can reduce the level of the debt-to-GDP ratio by intervening on the primary budget balance (i.e. the overall difference between government revenues and spending), for example through austerity policies in the form of spending cuts. By doing so the debt ratio dynamics modifies as

[TABLE]

The process $\nu$ is the control that the government chooses based on the information at its disposal. Precisely, $\nu_{t}$ defines the cumulative reduction of the debt-to-GDP ratio made by the government up to time $t$ , and $\nu$ is therefore a nondecreasing process belonging to the set

[TABLE]

for any given and fixed $x\in(0,\infty)$ initial value of $X^{\nu}$ , $q\in\mathcal{I}$ initial value of $\eta$ , and ${\underline{y}}\in\mathcal{Y}$ . Here

[TABLE]

is the probability simplex on $\mathbb{R}^{Q}$ , representing the space of initial distributions of the process $Z$ . From now on, we set $\nu_{0^{-}}=0$ a.s. for any $\nu\in\mathcal{M}(x,\underline{y},q)$ .

Remark 2.2.

Notice that in the definition of the set $\mathcal{M}$ above, as well as in (2.6) and in (P1) below, we have stressed the dependency on the initial data $(x,\underline{y},q)$ just for notational convenience, and not to stress any Markovian nature of the considered problem, which is in fact not such.

For any $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ and $\nu\in\mathcal{M}(x,\underline{y},q)$ , there exists a unique solution to (2.4), denoted by $X_{t}^{x,\nu}$ , that is given by

[TABLE]

where

[TABLE]

Here, and in the rest of this paper, we shall use the notation $\int_{0}^{t}(\,\cdot\,)d\nu_{s}=\int_{[0,t]}(\,\cdot\,)d\nu_{s}$ for the Lebesgue-Stieltjes integral with respect to the random measure $d\nu_{\cdot}$ induced by the nondecreasing process $\nu$ on $[0,\infty)$ .

Remark 2.3.

The dynamics (2.4) might be justified in the following way. Suppose that the public debt (in real terms), $D$ , and the GDP, $Y$ , follow the classical dynamics

[TABLE]

where $\xi_{t}$ is the cumulative real budget balance up to time $t$ . An easy application of Itô’s formula then gives that the ratio $X:=D/Y$ evolves as in (2.4), upon setting $d\nu:=d\xi/Y$ .

The government aims at reducing the level of the debt ratio. Having a level of debt ratio $X_{t}=x$ at time $t\geq 0$ when the state of the economy is $Z_{t}=i$ , the government incurs an instantaneous cost $h(x,i)$ . This cost may be interpreted as a measure of the resulting losses for the country due to the debt, as, e.g., a tendency to suffer low subsequent growth (see [22] and [45], among others). The cost function $h:\mathbb{R}\times S\mapsto\mathbb{R}_{+}$ fulfills the following requirements (see also [5] and [23])

Assumption 2.4.

(i)

For any $i\in S$ , the mapping $x\mapsto h(x,i)$ is strictly convex, continuously differentiable, and it is nondecreasing on $\mathbb{R}_{+}$ . Moreover, $h(0,i)=0$ ;

(ii)

For any given $x\in(0,\infty)$ and $i\in S$ one has

[TABLE]

A quadratic cost function of the form $h(x,i)=\frac{1}{2}\vartheta_{i}x^{2}$ , $(x,i)\in[0,\infty)\times S$ , $\vartheta_{i}>0$ , clearly satisfies Assumption 2.4 for a suitable $\rho>0$ .

Whenever the government intervenes in order to reduce the debt-to-GDP ratio, it incurs a proportional cost. We assume that the marginal cost of each intervention is normalized to one.

Given an intertemporal discount rate $\rho>0$ , for any given and fixed $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ , the government thus aims at minimizing the expected total cost functional

[TABLE]

Here $\mathsf{E}_{(x,\underline{y},q)}$ is the expectation under the condition that $X^{x,\nu}_{0^{-}}=x$ , $Z$ has initial distribution $\underline{y}$ , and $\eta_{0}=q$ . The government’s problem under partial observation can be therefore defined as

[TABLE]

One has that $V_{po}$ is well defined and finite. Indeed, it is nonnegative, due to the nonnegativity of $h$ ; moreover, since the admissible policy “instantaneously reduce at initial time the debt ratio to [math]” is a priori suboptimal and it has cost $x$ , then $V_{po}\leq x$ .

We would like to stress once more that any $\nu\in\mathcal{M}(x,\underline{y},q)$ is $\mathbb{H}$ -adapted, and therefore Problem (P1) is a stochastic optimal control problem under partial observation. In particular, it is a singular stochastic control problem under partial observation; that is, an optimal control problem in which the random measures induced by the nondecreasing control processes on $[0,\infty)$ might be singular with respect to the Lebesgue measure, and in which one component of the state variable, $Z$ , is not directly observable by the controller.

In its current formulation, the optimal debt reduction problem is not Markovian and it is therefore not directly solvable by standard means of stochastic control theory. In the next section, by using techniques from filtering theory, we will introduce an equivalent problem under complete information, the so-called separated problem. This will enjoy a Markovian structure and its solution will be characterized in Section 3.3 through a Markovian optimal stopping problem.

3. Reduction to an Equivalent Problem under Complete Information

In this section we derive the separated problem. To this end, we first study the filtering problem arising in our model. As already discussed in the introduction, results on such a filtering problem cannot be directly obtained from existing literature due to the structure of our dynamics.

3.1. The Filtering Problem

The filtering problem consists in finding the best-mean squared estimate of $f(Z_{t})$ , for any $t$ and any measurable function $f$ , on the basis of the information collected up to time $t$ . In our setting, such an information flow is given by the filtration $\mathbb{H}$ . That estimate can be described through the filter process ${(\pi_{t})}_{t\geq 0}$ , which provides the conditional distribution of $Z_{t}$ given $\mathcal{H}_{t}$ , for any time $t$ (see, for instance, [37]). It is defined as the $\mathbb{H}$ -càdlàg (right-continuous with left limits) process taking values in the space of probability measures on $S=\{1,\dots,Q\}$ such that

[TABLE]

for all measurable functions $f$ on $S$ . Since $Z$ takes only a finite number of values, the filter is completely described by the vector

[TABLE]

where $f_{i}(z):=\mathds{1}_{\{z=i\}}$ , $i\in S$ . With a slight abuse of notation, in the following we will denote by $\pi(i)$ the process $\pi(f_{i})$ , so that for all measurable functions $f$ we have from (3.1), $\pi_{t}(f)=\sum_{i=1}^{Q}f(i)\pi_{t}(i).$

Setting $\beta(Z_{t}):=r-g(Z_{t})$ and, accordingly, $\beta_{i}:=r-g(i)$ , $i\in S$ , notice that $\beta$ is clearly a bounded function. Then, define the two processes $I$ and $I^{1}$ such that for any $t\geq 0$

[TABLE]

where

[TABLE]

Henceforth, we will work under the following Novikov’s condition.

Assumption 3.1.

[TABLE]

Under Assumption 3.1, by classical results from filtering theory (see, e.g., [37]), the innovation processes $I$ and $I^{1}$ are Brownian motions with respect to the filtration $\mathbb{H}$ . Moreover, given the assumed independence of $B$ and $W$ , they turn out to be independent.

The integer-valued random measure associated to the jumps of $\eta$ is defined as

[TABLE]

where $\delta_{(a_{1},a_{2})}$ denotes the Dirac measure at point $(a_{1},a_{2})\in\mathbb{R}_{+}\times\mathbb{R}$ . Notice that the $\mathbb{H}$ -adapted random measure $m$ is such that

[TABLE]

To proceed further we need the following useful definitions.

Definition 3.2.

( $\mathbb{G}$ -Predictable Process indexed by $\mathbb{R}$ ). Given any filtration $\mathbb{G}$ , let ${\mathcal{P}}(\mathbb{G})$ denote the predictable $\sigma$ -field on $(0,\infty)\times\Omega$ and ${\mathcal{B}}(\mathbb{R})$ the Borel $\sigma$ -algebra on $\mathbb{R}$ . Any mapping $H:(0,\infty)\times\Omega\times\mathbb{R}\to\mathbb{R}$ which is ${\mathcal{P}}(\mathbb{G})\times{\mathcal{B}}(\mathbb{R})$ -measurable is called $\mathbb{G}$ -predictable process indexed by $\mathbb{R}$ .

Letting

[TABLE]

we denote by $\mathbb{F}^{m}:=(\mathcal{F}^{m}_{t})_{t\geq 0}$ the filtration generated by the random measure $m(dt,dq)$ .

Definition 3.3.

(Dual Predictable Projection of $m$ ). Given any filtration $\mathbb{G}$ , such that $\mathbb{F}^{m}\subseteq\mathbb{G}$ , the $\mathbb{G}$ -dual predictable projection of $m$ , denoted by $m^{p,\mathbb{G}}(dt,dq)$ , is defined as the unique positive $\mathbb{G}$ -predictable random measure such that for any nonnegative, $\mathbb{G}$ -predictable process $\Phi$ indexed by $\mathbb{R}$

[TABLE]

To prove that a positive $\mathbb{G}$ -predictable random measure provides the $\mathbb{G}$ -dual predictable projection of $m$ it suffices to prove that equation (3.9) holds true for any process of the form $\Phi(t,q)=C_{t}\mathds{1}_{A}(q)$ with $C$ a nonnegative $\mathbb{G}$ -predictable process and $A\in{\mathcal{B}}(\mathbb{R})$ . For further details we refer to [4] and [29].

We now aim at deriving an equation for the evolution of the filter (the filtering equation). To this end we use the so-called innovation approach (see [4], [11], and [37], among others), which, in our setting, requires the introduction of the $\mathbb{H}$ -compensated jump measure of $\eta$

[TABLE]

where $m^{p,\mathbb{H}}(dt,dq)$ is the $\mathbb{H}$ -dual predictable projection of $m$ (cf. Definition 3.3 above). The triplet $(I,I^{1},m^{\pi})$ also represents the building block of the construction of $\mathbb{H}$ -martingales, as it is shown in Proposition 3.5 below. We start determining the form of $m^{p,\mathbb{H}}$ .

Proposition 3.4.

The $\mathbb{H}$ -dual predictable projection of $m$ is given by

[TABLE]

where $\delta_{a}$ denotes the Dirac measure at point $a\in\mathbb{R}$ .

Proof.

Step 1. First, we prove that the $\mathbb{F}$ -dual predictable projection of $m$ is given by

[TABLE]

Let $A\in{\mathcal{B}}(\mathbb{R})$ and introduce

[TABLE]

$\mathcal{N}(A)$ is the point process counting the number of jumps of $\eta$ up to time $t$ with jumps’ size in the set $A$ . Since by (2.2) one has that $\Delta\eta_{s}=c(\eta_{s^{-}},Z_{s^{-}})\mathds{1}_{\{c(\eta_{s^{-}},Z_{s^{-}})\neq 0\}}\Delta N_{s}$ , $\forall s\geq 0$ , and $N$ is a point process with $\mathbb{F}$ -predictable intensity given by $\{\lambda^{N}(Z_{t^{-}})\}_{t\geq 0}$ , we obtain that for each $C$ nonnegative $\mathbb{F}$ -predictable process

[TABLE]

That is, for any $A\in{\mathcal{B}}(\mathbb{R})$ , we have that $\{\lambda^{N}(Z_{t^{-}})\mathds{1}_{\{c(\eta_{t^{-}},Z_{t^{-}})\in A\setminus\{0\}\}}\}_{t\geq 0}$ provides the $\mathbb{F}$ -predictable intensity of the counting process $\mathcal{N}(A)$ . Recalling (3.13) and Definition 3.3, this implies that $m^{p,\mathbb{F}}(dt,dq)$ given in (3.12) coincides with the $\mathbb{F}$ -dual predictable projection of $m$ , since equation (3.9) holds with the choice $\mathbb{G}=\mathbb{F}$ and $\Phi(t,q)=C_{t}\mathds{1}_{A}(q)$ , with $C$ an arbitrary nonnegative $\mathbb{F}$ -predictable process and $A\in{\mathcal{B}}(\mathbb{R})$ .

Step 2. As in Proposition 2.3 in [10] we can now derive the $\mathbb{H}$ -dual predictable projection of $m^{p,\mathbb{F}}$ , denoted by $m^{p,\mathbb{H}}(dt,dq)$ , by simply projecting $m^{p,\mathbb{F}}$ with respect to the observation flow $\mathbb{H}$ . Precisely, we have that the $\mathbb{H}$ -predictable intensity of the point process $\mathcal{N}(A)$ , $\forall A\in{\mathcal{B}}(\mathbb{R})$ , is given by

[TABLE]

This implies that $m^{p,\mathbb{H}}(dt,dq)$ is given by (3.11), since (3.9) holds with the choice $\mathbb{G}=\mathbb{H}$ , $\Phi(t,q)=C_{t}\mathds{1}_{A}(q)$ , with $A\in{\mathcal{B}}(\mathbb{R})$ and $C$ an arbitrary nonnegative $\mathbb{H}$ -predictable process. ∎

An essential tool to prove that the original problem under partial information is equivalent to the separated one is the characterization of the filter as the unique solution to the filtering equation (see [7] and [21]). In order to derive the filtering equation solved by $\pi$ we first give a representation theorem for $\mathbb{H}$ -martingales. The proof of the following technical result is given in Appendix A.

Proposition 3.5.

Under Assumptions 2.1 and 3.1, every $\mathbb{H}$ -local martingale $M$ admits the decomposition

[TABLE]

where $\varphi$ and $\psi$ are $\mathbb{H}$ -adapted processes, and $w$ is an $\mathbb{H}$ -predictable process indexed by $\mathbb{R}$ such that a.s.

[TABLE]

We are now in the position to prove the following fundamental result, whose proof is postponed to Appendix A.

Theorem 3.6.

Recall (3.10), let ${\underline{y}}\in\mathcal{Y}$ be the initial distribution of $Z$ , and let Assumptions 2.1 and 3.1 hold. Then the filter ${\underline{\pi}}_{t}:=(\pi_{t}(i);i\in S)_{t\geq 0}$ solves the Kushner-Stratonovich system

[TABLE]

for any $i\in S$ . Here, $\beta_{i}=r-g(i)$ and

[TABLE]

denotes the Radon-Nikodym derivative of the measure $\lambda^{N}(i)\pi_{s^{-}}(i)\mathds{1}_{\{c(\eta_{s^{-}},i)\neq 0\}}\delta_{c(\eta_{s^{-}},i)}(dq)$ with respect to $\sum_{j=1}^{Q}\pi_{s^{-}}(j)\lambda^{N}(j)\mathds{1}_{\{c(\eta_{s^{-}},j)\neq 0\}}\delta_{c(\eta_{s^{-}},j)}(dq)$ .

Let us introduce the sequence of jump times and jump sizes of the process $\eta$ , denoted by $\{T_{n},\zeta_{n}\}_{n\geq 1}$ , and recursively defined as $T_{1}:=\inf\{t>0:\int_{0}^{t}c(\eta_{s^{-}},Z_{s^{-}})dN_{s}\neq 0\}$ ,

[TABLE]

In the definitions above we use the standard convention that $\inf\emptyset=+\infty$ .

Then the integer-valued measure associated to the jumps of $\eta$ (cf. (3.6)) can also be written as

[TABLE]

The filtering system of equations (3.14) has a natural recursive structure in terms of the sequence $\{T_{n}\}_{n\geq 1}$ , as it is shown in the next proposition.

Proposition 3.7.

Between two consecutive jump times, $t\in[T_{n},T_{n+1})$ , the filtering system of equations (3.14) reads as

[TABLE]

for any $i\in S$ . At a jump time of $\eta$ , say $T_{n}$ , ${\underline{\pi}}_{t}=(\pi_{t}(i);i\in S)_{t\geq 0}$ jumps as well, and its value is given by

[TABLE]

Proof.

First, recalling that $m^{\pi}(dt,dq)=m(dt,dq)-m^{p,\mathbb{H}}(dt,dq),$ and that

[TABLE]

we obtain that

[TABLE]

which, from (3.14), implies that for any $t\in[T_{n},T_{n+1})$ , $\pi_{t}(i)$ solves equation (3.7).

Finally, equation (3.18) follows by (3.15) and

[TABLE]

∎

We want to stress that equation (3.18) shows that the vector ${\underline{\pi}}_{T_{n}}$ is completely determined by the observed data $\eta$ and by the knowledge of ${\underline{\pi}}_{t}$ for $t\in[T_{n-1},T_{n})$ , since $\pi_{T_{n}^{-}}(i):=\lim_{t\uparrow T_{n}}\pi_{t}(i)$ , $i\in S$ .

Remark 3.8.

A few comments on the filtering equation are worth being done.

(1)

In the case $c(q,i)\equiv c\neq 0$ , for any $i\in S$ and $q\in\mathcal{I}$ , the sequences of jump times of $\eta$ and $N$ coincide, and the filtering system of equations (3.14) reduces to the simpler

[TABLE] 2. (2)

In the case $\alpha(q,i)=\alpha(i)$ and $c(q,i)\equiv 0$ , for any $i\in S$ and $q\in\mathcal{I}$ , the filtering system of equations (3.14) does not depend anymore explicitly on the process $\eta$ . In particular, one has

[TABLE]

where we have set $\alpha_{i}:=\sigma_{2}^{-1}\big{\{}b_{1}(i)-\sigma^{-1}\beta_{i}\sigma_{1}\big{\}}$ . With reference to (2.2) and (3.4), this setting corresponds, e.g., to the purely diffusive arithmetic case $c(q,i)=0$ , $b_{1}(q,i)=b_{1}(i)$ and $\sigma_{1}(q)=\sigma_{1}>0$ , $\sigma_{2}(q)=\sigma_{2}>0$ , for any $i\in S$ and $q\in\mathcal{I}$ , or to the purely diffusive geometric case $c(q,i)=0$ , $b_{1}(q,i)=b_{1}(i)q$ and $\sigma_{1}(q)=\sigma_{1}q$ , $\sigma_{2}(q)=\sigma_{2}q$ , for any $i\in S$ and $q\in\mathcal{I}$ . In Section 4 we will provide the explicit solution to the optimal debt reduction problem within this setting.

3.2. The Separated Problem

Thanks to the introduction of the filter, equations (2.1), (2.2), and (2.4) can now be rewritten in terms of observable processes. In particular, we have that

[TABLE]

and

[TABLE]

Notice that, for any $\nu\in\mathcal{M}(x,\underline{y},q)$ , the process $X^{\nu}$ turns out to be $\mathbb{H}$ -adapted, and depends on the vector ${\underline{\pi}}_{t}=(\pi_{t}(i);i\in S)_{t\geq 0}$ , such that ${\underline{\pi}}_{0}={\underline{y}}\in\mathcal{Y}$ .

Definition 3.9.

(Strong Uniqueness). We say that a process $({\underline{\widetilde{\pi}}}_{t},\widetilde{\eta}_{t})_{t\geq 0}$ with values in $\mathcal{Y}\times\mathcal{I}$ is a strong solution to equations (3.14) and (3.21) if it satisfies pathwise those equations. We say that strong uniqueness for the system of equations (3.14) and (3.21) holds if, for any $({\underline{\widetilde{\pi}}}_{t},\widetilde{\eta}_{t})_{t\geq 0}$ strong solution to system (3.14) and (3.21), one has ${\underline{\widetilde{\pi}}}_{t}={\underline{\pi}}_{t}$ , $\widetilde{\eta}_{t}=\eta_{t}$ , a.s. for all $t\geq 0$ .

Proposition 3.10.

Let Assumptions 2.1 and 3.1 hold, and suppose that $\alpha(\cdot,i)$ is locally-Lipschitz for any $i\in S$ , and there exists $M>0$ such that $|\alpha(q,i)|\leq M(1+|q|)$ , for any $q\in\mathcal{I}$ and any $i\in S$ . Then system (3.14) and (3.21) admits a unique strong solution.

Notice that, under Assumption 2.1, the requirement on $\alpha$ of Proposition 3.10 is verified, e.g., whenever $\sigma_{2}(q)\geq\kappa$ , for some $\kappa>0$ and for any $q\in\mathcal{I}$ , or if $b_{1}/\sigma_{2}$ and $\sigma_{1}/\sigma_{2}$ are locally-Lipschitz on $q\in\mathcal{I}$ and have sublinear growth. The proof of Proposition 3.10 is postponed to Appendix A. As a byproduct, it also ensures strong uniqueness of the solution to (3.22). In the following, when there will be the need to stress the dependence with respect to the initial value $x>0$ , we shall denote the solution to (3.20) and (3.22) by $X^{x,0}$ and $X^{x,\nu}$ , respectively.

Since

[TABLE]

an application of Fubini-Tonelli’s theorem allows to rewrite also the cost functional of (2.6) in terms of observable quantities as

[TABLE]

Here $\mathsf{E}_{(x,\underline{y},q)}$ denotes the expectation conditioned on $X^{\nu}_{0^{-}}=x>0$ , $\underline{\pi}_{0}=\underline{y}\in\mathcal{Y}$ , and $\eta_{0}=q\in\mathcal{I}$ . Notice that the latter expression does not depend anymore on the unobservable process $Z$ , and this allows us to introduce a control problem with complete information, the separated problem, in which the new state variable is given by the triplet $(X^{\nu},\underline{\pi},\eta)$ . For this problem we rewrite the set $\mathcal{M}(x,{\underline{y}},q)$ in terms of the observable processes given by (3.14), (3.21) and (3.22), and we denote by $\mathcal{A}(x,{\underline{y}},q)$ such a representation of the set $\mathcal{M}(x,{\underline{y}},q)$ ; that is,

[TABLE]

for every $x\in(0,\infty)$ initial value of $X^{x,\nu}$ defined in (3.22), for any ${\underline{y}}\in\mathcal{Y}$ initial values of the process ${\underline{\pi}}_{t}=(\pi_{t}(i);i\in S)_{t\geq 0}$ solution to equation (3.14), and for any $q\in\mathcal{I}$ initial value of $\eta$ . In the following, we set $\nu_{0^{-}}=0$ a.s. for any $\nu\in\mathcal{A}(x,{\underline{y}},q)$ .

Given $\nu\in\mathcal{A}(x,{\underline{y}},q)$ , the triplet $\{(X_{t}^{x,\nu},{\underline{\pi}}_{t},\eta_{t})\}_{t\geq 0}$ solves (3.22), (3.14) and (3.21) and the jump measure associated to $\eta$ has $\mathbb{H}$ -predictable dual projection given by equation (3.11). Hence, the process $\{(X_{t}^{x,\nu},{\underline{\pi}}_{t},\eta_{t})\}_{t\geq 0}$ is an $\mathbb{H}$ -Markov process and we therefore define the Markovian separated problem as

[TABLE]

This is now a singular stochastic problem under complete information, since all the processes involved are $\mathbb{H}$ -adapted.

The next proposition immediately follows from the previous construction of the separated problem, and from the strong uniqueness of the solutions to (3.14), (3.21), and (3.22).

Proposition 3.11.

Assume strong uniqueness for the system of equations (3.14) and (3.21), and let $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ be the initial values of the process $(X,Z,\eta)$ in the problem under partial observation (P1). Then

[TABLE]

Moreover, $\nu^{*}\in\mathcal{A}(x,\underline{y},q)$ is an optimal control for the separated problem (P2) if and only if $\nu^{*}\in\mathcal{M}(x,\underline{y},q)$ is an optimal control for the original problem under partial observation (P1).

Remark 3.12.

Notice that in the setting of Remark 3.8-(2), the pair $(X^{x,\nu},{\underline{\pi}})$ solving equations (3.22) and (3.14), respectively, is an $\mathbb{H}$ -Markov process, for any $\nu\in\mathcal{A}(x,\underline{y},q)$ , $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ . As a consequence, since the cost functional and the set of admissible controls do not depend explicitly on the process $\eta$ , the value function of the separated problem (P2) does not depend anymore on the variable $q$ . We will consider this setting as a case study in Section 4.

3.3. A Probabilistic Verification Theorem via Reduction to Optimal Stopping

In this section we relate the separated problem to a Markovian optimal stopping problem, and we show that the solution to the latter is directly related to the optimal control of the former. The following analysis is fully probabilistic and it is based on a change of variable formula for Lebesgue-Stieltjes integrals that has been already employed in singular control problems (see, e.g., [2] and [23]). The result of this section will then be employed in Section 4 where, in a case study, we determine the expression of the optimal debt reduction policy by solving an auxiliary optimal stopping problem.

With regard to Problem (P2), notice that $\pi_{t}\big{(}h(X_{t}^{x,\nu},\cdot)\big{)}=\sum_{i=1}^{Q}\pi_{t}(i)h(X_{t}^{x,\nu},i)$ a.s. for any $t\geq 0$ . Then, for any $(x,\underline{\pi})\in(0,\infty)\times\mathcal{Y}$ , set

[TABLE]

and, given $z\in(0,\infty)$ , we introduce the optimal stopping problem

[TABLE]

where the optimization is taken over all the $\mathbb{H}$ -stopping times $\tau\geq t$ .

Under Assumption 2.4, the expectation in (3.26) is finite for any $\mathbb{H}$ -stopping time $\tau\geq t$ , for any $t\geq 0$ . To take care of the event $\{\tau=\infty\}$ , in (3.26) we make use of the convention

[TABLE]

Denote by $U_{t}(z)$ a càdlàg modification of $\widetilde{U}_{t}(z)$ , and observe that $0\leq U_{t}(z)\leq X^{1,0}_{t}$ , for any $t\geq 0$ , a.s. Also, define the stopping time

[TABLE]

with the convention that $\tau_{t}^{*}(z)=\infty$ if the set on the right-hand side is empty. Then by Theorem D.12 in Appendix D of [34], $\tau_{t}^{*}(z)$ is an optimal stopping time for problem (3.26). In particular, $\tau^{*}(z):=\tau_{0}^{*}(z)$ is optimal for the problem

[TABLE]

Notice that since $h_{x}(\cdot,\underline{\pi})$ is a.s. increasing, then $z\mapsto\tau^{*}(z)$ is a.s. decreasing. Such monotonicity of $\tau^{*}(\,\cdot\,)$ will be important in the following as we will need to consider its generalized inverse. Moreover, since the triplet $(X^{z,0}_{t},\underline{\pi}_{t},\eta_{t})$ is an homogenous $\mathbb{H}$ -Markov process, there exists a measurable function $U:(0,\infty)\times\mathcal{Y}\times\mathcal{I}\to\mathbb{R}$ such that $U_{t}(z)=U(X^{z,0}_{t},\underline{\pi}_{t},\eta_{t})$ for any $t\geq 0$ , a.s. Hence, $U_{0}(z)=U(z,\underline{y},q)$ , and for any $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ , define

[TABLE]

Moreover, introduce the nondecreasing, right-continuous process

[TABLE]

and then also the process

[TABLE]

Notice that $\overline{\nu}^{*}_{\cdot}$ is the right-continuous inverse of $\tau^{*}(\,\cdot\,)$ .

Theorem 3.13.

Let $\widetilde{V}$ be as in (3.30) and $V$ as in the definition of Problem (P2). Then one has $\widetilde{V}=V$ , and $\nu^{*}$ is the (unique) optimal control for Problem (P2).

Proof.

Step 1. Let $x>0$ , $\underline{y}\in\mathcal{Y}$ , and $q\in\mathcal{I}$ be given and fixed. For $\nu\in\mathcal{A}(x,\underline{y},q)$ , we introduce the process $\overline{\nu}$ such that $\overline{\nu}_{t}:=\int_{0}^{t}\frac{d{\nu}_{s}}{X^{1,0}_{s}}$ , $t\geq 0$ , and define its inverse (see, e.g., Chapter 0, Section 4 of [43]) by

[TABLE]

Notice that the process $\tau^{\overline{\nu}}(z):=\{\tau^{\overline{\nu}}(z),\ z\leq x\}$ has decreasing, left-continuous sample paths, and hence it admits right-limits

[TABLE]

Moreover, the set of points $z\in\mathbb{R}$ at which $\tau^{\overline{\nu}}(z)(\omega)\neq\tau^{\overline{\nu}}_{+}(z)(\omega)$ is a.s. countable for a.e. $\omega\in\Omega$ .

The random time $\tau^{\overline{\nu}}(z)$ is actually an $(\mathcal{H}_{t})$ -stopping time because it is the entry time of an open set of the right-continuous process $\overline{\nu}$ , and $(\mathcal{H}_{t})_{t\geq 0}$ is right-continuous. Moreover, since $\tau^{\overline{\nu}}_{+}(z)$ is the first entry time of the right-continuous process $\overline{\nu}$ into a closed set, it is an $(\mathcal{H}_{t})$ -stopping time as well for any $z\leq x$ .

Proceeding then as in Step 1 of the proof of Theorem 3.1 in [23], by employing the change of variable formula in Chapter 0, Proposition 4.9 of [43], one finds that

[TABLE]

Hence, since $\nu$ was arbitrary, we find

[TABLE]

Step 2. To complete the proof we have to show the reverse inequality. Let $x\in(0,\infty)$ , $\underline{y}\in\mathcal{Y}$ , and $q\in\mathcal{I}$ , initial values of $X^{x,\nu}$ , $\underline{\pi}$ and $\eta$ . We first notice that $\nu^{*}\in\mathcal{A}(x,\underline{y},q)$ . Indeed, $\nu^{*}$ is nondecreasing, right-continuous and such that $X^{x,\nu^{*}}_{t}=X^{1,0}_{t}(x-\overline{\nu}^{*}_{t})\geq 0$ a.s. for all $t\geq 0$ , since one has by definition $\overline{\nu}^{*}_{t}\leq x$ a.s. Moreover, for any $0<z\leq x$ , we can write (cf. (3.31) and (3.34))

[TABLE]

Then, recalling that $\tau^{\overline{\nu}^{*}}_{+}(z)=\tau^{\overline{\nu}^{*}}(z)$ $\mathbb{P}$ -a.s. and for a.e. $z\leq x$ , we pick $\nu=\nu^{*}$ (equivalently, $\overline{\nu}=\overline{\nu}^{*}$ ), and following Step 2 in the proof of Theorem 3.1 of [23], we obtain $\widetilde{V}(x,\underline{y},q)=\mathcal{J}_{x,\underline{y},q}(\nu^{*})$ . That is, $\widetilde{V}=V$ by (3.35) and admissibility of $\nu^{*}$ . Therefore $\nu^{*}$ is optimal. In fact, $\nu^{*}$ is the unique optimal control in the class of controls belonging to $\mathcal{A}(x,\underline{y},q)$ and such that $\mathcal{J}_{x,\underline{y},q}(\nu)<\infty$ by strict convexity of $\mathcal{J}_{x,\underline{y},q}(\,\cdot\,)$ . ∎

Remark 3.14.

For any given $(x,\underline{y},q)\in(0,\infty)\times\mathcal{Y}\times\mathcal{I}$ , define the Markovian optimal stopping problem

[TABLE]

where $\mathsf{E}_{(x,\underline{y},q)}$ denotes the expectation under the probability measure $\mathsf{P}_{(x,\underline{y},q)}$ such that $\mathsf{P}(\,\cdot\,):=\mathsf{P}(\,\cdot\,|X^{x,0}_{0}=x,\underline{\pi}_{0}=\underline{y},\eta_{0}=q)$ . Then, it is readily verified that $v(x,\underline{y},q)=xU(x,\underline{y},q)$ . Moreover, it holds that the stopping time

[TABLE]

is optimal for $v(x,\underline{y},q)$ .

4. The Solution in a Case Study with $Q=2$ Economic Regimes

In this section, we build on the general filtering analysis developed in the previous sections and on the result of Theorem 3.13, and we provide the form of the optimal debt reduction policy in a case study that is defined through the following standing assumption.

Assumption 4.1.

(1)

$Z$ * takes values in $S=\{1,2\}$ , and, with reference to (2.4), we take $g_{2}:=g(2)<g(1)=:g_{1}$ ;* 2. (2)

for any $q\in\mathcal{I}$ and any $i\in\{1,2\}$ one has $c(q,i)=0$ and, for $\alpha$ as in (3.4), we take $\alpha(q,i)=\alpha(i)$ ; 3. (3)

$h(x,i)=h(x)$ * for all $(x,i)\in(0,\infty)\times\{1,2\}$ , with $h:\mathbb{R}\to\mathbb{R}$ such that:*

(i)

$x\mapsto h(x)$ * is strictly convex, twice-continuously differentiable, and nondecreasing on $\mathbb{R}_{+}$ with $h(0)=0$ and $\lim_{x\uparrow\infty}h(x)=\infty$ ;*

(ii)

there exist $\gamma>1$ , $0<K_{o}<K$ and $K_{1},K_{2}>0$ such that

[TABLE]

and

[TABLE]

Notice that under Assumption 4.1-(2) the macroeconomic indicator $\eta$ has a suitable diffusive dynamics whose coefficients $b_{1},\sigma_{1},\sigma_{2}$ are such that the function $\alpha$ is independent of $q$ . As discussed in Remark 3.8- $(2)$ , this is the case of a geometric or arithmetic diffusive dynamics for $\eta$ . In this setting the Kushner-Stratonovich system (3.14) reduces to

[TABLE]

and $\pi_{t}(2)=1-\pi_{t}(1)$ . Here, $\lambda_{1}:=\lambda_{12}>0$ and $\lambda_{2}:=\lambda_{21}>0$ .

Denoting by $\pi_{t}:=\pi_{t}(1)$ , $t\geq 0$ , problem (P2) then reads as

[TABLE]

where $g_{i}=r-\beta_{i}$ , denotes the rate of economic growth in the state $i$ , $i=1,2$ .

It is worth noticing that there is no need to involve the process $\eta$ in the Markovian formulation of problem (P3). This is due to the fact that the couple $(X^{\nu},\pi)$ , solving the two stochastic differential equations above is a strong Markov process, and the cost functional and the set of admissible controls (denoted by $\mathcal{A}(x,y)$ above) do not depend explicitly on $\eta$ . For this reason the value function of Problem (P3) does not depend on the initial value $q$ of the process $\eta$ . However, memory of the macroeconomic indicator process $\eta$ appears in the filter $\pi$ through the constant term $\alpha_{1}-\alpha_{2}$ in its dynamics.

Finally, we recall that, thanks to Propostion 3.11, by solving Problem (P3) we are also solving the original problem (P1). Indeed, we have that

[TABLE]

and a control is optimal for the separated problem (P3) if and only if it is such for the original problem under partial observation.

In the following analysis, we need (for technical reasons due to the infinite time-horizon of our problem) to take a discount factor sufficiently large. Namely, defining

[TABLE]

with $\theta^{2}:=\frac{1}{2}\big{[}\frac{(g_{1}-g_{2})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}$ , we assume the following.

Assumption 4.2.

One has $\rho>\rho_{o}^{+}$ .

Due to the growth condition on $h$ , Assumption 4.2 in particular ensures that $\rho>\gamma\beta_{2}+\frac{1}{2}\sigma^{2}\gamma(\gamma-1)$ so that the (trivial) admissible control $\nu\equiv 0$ has a finite total expected cost.

4.1. The Related Optimal Stopping Problem

Motivated by the results of the previous sections (see in particular Theorem 3.13), we now aim at solving Problem (P3) through the study of an auxiliary optimal stopping problem. Informally, the solution to such an optimal stopping problem gives the optimal time at which the government should reduce the debt ratio by one additional unit. The optimal stopping problem involves a two-dimensional diffusive process, and in the following we provide an almost exclusively probabilistic analysis.

4.1.1. Formulation and Preliminary Results

Recall that $(I_{t},I^{1}_{t})_{t\geq 0}$ is a two-dimensional, standard $\mathbb{H}$ -Brownian motion, and introduce the two-dimensional diffusion process $(\widehat{X},\pi):=(\widehat{X}_{t},\pi_{t})_{t\geq 0}$ solving the stochastic differential equations (SDEs)

[TABLE]

with initial conditions $\widehat{X}_{0}=x$ , $\pi_{0}=y$ for any $(x,y):=(0,\infty)\times(0,1)$ . In the following, we set $\mathcal{O}:=(0,\infty)\times(0,1)$ . Recall that $\beta_{2}=r-g_{2}$ .

Since the process $\pi$ is bounded, classical results on SDEs ensure that system (4.2) admits a unique strong solution, that, when needed, we shall denote by $(\widehat{X}^{x,y},\pi^{y})$ in order to stress its dependence on the initial datum $(x,y)\in\mathcal{O}$ . In particular, one easily obtains

[TABLE]

Moreover, it can be shown that the Feller’s test of explosion (see, e.g., Chapter 5.5 in [33]) gives that $1=\mathsf{P}(\pi^{y}_{t}\in(0,1),\,\,\forall t\geq 0)$ for all $y\in(0,1)$ . In fact, the boundary points [math] and $1$ are entrance-not-exit (cf. [3], p. 15), hence unattainable for the process $\pi$ .

With regard to Remark 3.14, here we study the fully two-dimensional Markovian optimal stopping problem with value function

[TABLE]

In (4.4) the optimization is taken over all the $\mathbb{H}$ -stopping times, and the symbol $\mathsf{E}_{(x,y)}$ denotes the expectation under the probability measure $\mathsf{P}_{(x,y)}$ on $(\Omega,\mathcal{F})$ , defined as $\mathsf{P}_{(x,y)}(\,\cdot\,):=\mathsf{P}(\,\cdot\,|\widehat{X}_{0}=x,\pi_{0}=y)$ , for any $(x,y)\in\mathcal{O}$ .

Due to the fact that $\pi$ is positive, $g_{2}-g_{1}<0$ , and $\rho>\beta_{2}$ by Assumption 4.2, one has from (4.3) that

[TABLE]

which implies the convention (cf. (3.27)) $e^{-\rho\tau}\widehat{X}_{\tau}=0$ on $\{\tau=\infty\}$ .

Clearly, one has $v\geq 0$ since $\widehat{X}$ is positive and $h$ is increasing on $\mathbb{R}_{+}$ . Also, $v\leq x$ on $\mathcal{O}$ , and we can therefore define the continuation region and the stopping region as

[TABLE]

Notice that integrating by parts the term $e^{-\rho\tau}\widehat{X}_{\tau}$ , taking expectations, and exploiting that for any $\mathbb{H}$ -stopping time $\tau$ one has $\mathsf{E}[\int_{0}^{\tau}e^{-\rho s}\widehat{X}_{s}dI_{s}]=0$ (because $\rho>\beta_{2}+\frac{1}{2}\sigma^{2}$ by Assumption 4.2), we can equivalently rewrite (4.4) as

[TABLE]

for any $(x,y)\in\mathcal{O}$ . From (4.7) it is readily seen that

[TABLE]

which implies

[TABLE]

Moreover, since $\rho$ satisfies Assumption 4.2, and $0\leq\pi_{t}\leq 1$ for any $(x,y)\in\mathcal{O}$ , one has that

[TABLE]

and the family of random variables

[TABLE]

is therefore $\mathbb{H}$ -uniformly integrable under $\mathsf{P}_{(x,y)}$ .

Preliminary properties of $v$ are given in the next proposition.

Proposition 4.3.

The following hold:

(i)

$x\mapsto v(x,y)$ * is increasing for any $y\in(0,1)$ ;*

(ii)

$y\mapsto v(x,y)$ * is decreasing for any $x\in(0,\infty)$ ;*

(iii)

$(x,y)\mapsto v(x,y)$ * is continuous in $\mathcal{O}$ .*

Proof.

We prove each claim separately.

(i). Recall (4.4). By the strict convexity and the monotonicity of $h$ and (4.3), it follows that $x\mapsto\widehat{\mathcal{J}}_{(x,y)}(\tau)$ is increasing for any $\mathbb{H}$ -stopping time $\tau$ , and for any $y\in(0,1)$ . Hence the claim is proved.

(ii). This is due to the fact that $y\mapsto\widehat{\mathcal{J}}_{(x,y)}(\tau)$ is decreasing for any stopping time $\tau$ and any $x\in(0,\infty)$ . Indeed, the mapping $y\mapsto\widehat{X}^{x,y}_{t}$ is a.s. decreasing for any $t\geq 0$ (because $y\mapsto\pi^{y}_{t}$ is a.s. increasing by the comparison theorem of Yamada and Watanabe - see, e.g., Proposition 2.18 in Chapter 5.2 of [33] - and $g_{2}-g_{1}<0$ ), and $x\mapsto xh^{\prime}(x)$ is increasing.

(iii). Since $(x,y)\mapsto(\widehat{X}^{x,y}_{t},\pi^{y}_{t})$ is a.s. continuous for any $t\geq 0$ , it is not hard to verify that $(x,y)\mapsto\widehat{\mathcal{J}}_{(x,y)}(\tau)$ is continuous for any given $\tau\geq 0$ . Hence, $v$ is upper semicontinuous. We now show that it is also lower semicontinuous.

Let $(x,y)\in\mathcal{O}$ and let $(x_{n},y_{n})_{n}\subseteq\mathcal{O}$ be any sequence converging to $(x,y)$ . Without loss of generality, we may take $(x_{n},y_{n})\in(x-\delta,x+\delta)\times(y-\delta,y+\delta)$ , for a suitable $\delta>0$ . Letting $\tau^{n}_{\varepsilon}:=\tau^{n}_{\varepsilon}(x_{n},y_{n})$ be an $\varepsilon$ -optimal for $v(x_{n},y_{n})$ , but suboptimal for $v(x,y)$ , we can then write

[TABLE]

Notice now that a.s.

[TABLE]

where we have used that $x\mapsto\widehat{X}^{x,y}$ is increasing, $y\mapsto\widehat{X}^{x,y}$ is decreasing, and $x\mapsto xh^{\prime}(x)$ is positive and increasing. The random variable on the right-hand side of the latter equation is independent of $n$ and integrable due to (4.10).

Also, by an integration by parts, and performing standard estimates, we can write that a.s.

[TABLE]

and the last integral above is independent of $n$ and it has finite expectation due to (4.10).

Then, taking limits as $n\uparrow\infty$ , invoking the dominated convergence theorem thanks to the previous estimates, and using that $(x,y)\mapsto(\widehat{X}^{x,y}_{t},\pi^{y}_{t})$ is a.s. continuous for any $t\geq 0$ we find (after rearranging terms) that

[TABLE]

We thus conclude that $v$ is lower semicontinuous at $(x,y)$ by arbitrariness of $\varepsilon$ . Since $(x,y)\in\mathcal{O}$ was arbitrary as well, then $v$ is lower semicontinuous on $\mathcal{O}$ . ∎

Due to Proposition 4.3-(iii) one has that the stopping region is closed, whereas the continuation region is open. Moreover, thanks to (4.10) and the $\mathsf{P}_{(x,y)}$ -a.s. continuity of $t\mapsto\int_{0}^{t}e^{-\rho s}\widehat{X}_{s}(h^{\prime}(\widehat{X}_{s})-(\rho-\beta_{2}-(g_{2}-g_{1})\pi_{s})ds$ , we can apply Theorem D.12 in Appendix D of [34] to obtain that the first entry time of $(\widehat{X},\pi)$ into $\mathcal{S}$ is optimal for (4.4); that is,

[TABLE]

attains the infimum in (4.4) (here we adopt the usual convention $\inf\emptyset=\infty$ ).

Also, by employing standard means based on the strong Markov property of $(\widehat{X},\pi)$ (see, e.g., [40], Ch. I, Sec. 2, Thm. 2.4), one can show that, $\mathsf{P}_{(x,y)}$ -a.s., the process $S:=\big{(}S_{t}\big{)}_{t\geq 0}$ , with

[TABLE]

and that the stopped process $(S_{t\wedge\tau^{\star}}\big{)}_{t\geq 0}$ is an $\mathbb{H}$ -martingale. The latter two conditions are usually referred to as the subharmonic characterization of the value function $v$ .

We now rule out the possibility of an empty stopping region.

Lemma 4.4.

The stopping region of (4.6) is not empty.

Proof.

We argue by contradiction and we suppose that $\mathcal{S}=\emptyset$ . Hence, for any $(x,y)\in\mathcal{O}$ we can write

[TABLE]

where the inequality $xh^{\prime}(x)\geq h(x)$ , due to convexity of $h$ , and the growth condition assumed on $h$ (cf. Assumption 4.1) have been used. Now, by taking $x$ sufficiently large, we reach a contradiction since $\gamma>1$ by assumption. Hence $\mathcal{S}\neq\emptyset$ . ∎

Proposition 4.5.

For any $y\in(0,1)$ let

[TABLE]

where the convention $\inf\emptyset=+\infty$ has been used. Then

(i)

[TABLE]

(ii)

$y\mapsto d(y)$ * is increasing and left-continuous;*

(iii)

there exist $0<x_{\star}<x^{\star}<\infty$ such that for any $y\in[0,1]$

[TABLE]

Proof.

(i). To show that (4.15) holds true it suffices to show that if $(x_{1},y)\in\mathcal{S}$ , then $(x_{2},y)\in\mathcal{S}$ for any $x_{2}\geq x_{1}$ . Let $\tau^{\varepsilon}:=\tau^{\varepsilon}(x_{2},y)$ be an $\varepsilon$ -optimal stopping time for $v(x_{2},y)$ . Then, exploiting the fact that $\widehat{X}^{x_{2},y}_{t}=\frac{x_{2}}{x_{1}}\widehat{X}^{x_{1},y}_{t}\geq\widehat{X}^{x_{1},y}_{t}$ a.s. and the monotonicity of $h^{\prime}$ , we can write from (4.7)

[TABLE]

Therefore, by arbitrariness of $\varepsilon$ , we conclude that $(x_{2},y)\in\mathcal{S}$ as well, and therefore that $d$ as in (4.14) splits $\mathcal{C}$ and $\mathcal{S}$ as in (4.15).

(ii). Let $(x,y_{1})\in\mathcal{C}$ . Since $y\mapsto v(x,y)$ is decreasing by Proposition 4.3-(ii), it thus follows that $(x,y_{2})\in\mathcal{C}$ for any $y_{2}\geq y_{1}$ . This in turn implies that $y\mapsto d(y)$ is increasing. The monotonicity of $y\mapsto d(y)$ , together with the fact that $\mathcal{S}$ is closed, then give the claimed left-continuity by standard arguments.

(iii). Let $\Theta^{x}_{t}:=x\exp\big{\{}(\beta_{2}-\frac{1}{2}\sigma^{2}+(g_{2}-g_{1}))t+\sigma I_{t}\big{\}}$ , and introduce the one-dimensional optimal stopping problem

[TABLE]

Because $g_{2}-g_{1}<0$ , $h^{\prime}$ is increasing, and $\pi^{y}_{t}\leq 1$ a.s. for all $t\geq 0$ and $y\in(0,1)$ , it is not hard to see that $v(x,y)\geq v^{\star}(x)$ for any $(x,y)\in\mathcal{O}$ .

By arguments similar to those employed to prove (i) above one can show that there exists $x^{\star}$ such that $\{x\in(0,\infty):\,v^{\star}(x)\geq x\}=\{x\in(0,\infty):\,x\geq x^{\star}\}$ . In fact, by arguing as in the proof of Lemma 4.4, one has that the latter set is not empty. Then the following inclusions hold

[TABLE]

which in turn show that $d(y)\leq x^{\star}$ for all $y\in(0,1)$ . Hence, also $d(y)\leq x^{\star}$ for all $y\in[0,1]$ , by setting $d(0+):=\lim_{y\downarrow 0}d(y)$ by monotonicity, and $d(1):=\lim_{y\uparrow 0}d(y)$ by left-continuity.

As for the lower bound of $d$ , notice that (4.9) implies

[TABLE]

where $(h^{\prime})^{-1}(\,\cdot\,)$ is the inverse of the strictly increasing function $h^{\prime}:[0,\infty)\mapsto(0,\infty)$ (notice that $\rho-\beta_{2}-(g_{2}-g_{1})y\geq 0$ since $\rho>\beta_{2}$ , $g_{2}-g_{1}<0$ , and $y>0$ ). Since $(h^{\prime})^{-1}$ is strictly increasing, and $-(g_{2}-g_{1})y\geq 0$ , we can conclude from (4.18) that $d(y)\geq(h^{\prime})^{-1}\big{(}\rho-\beta_{2})$ for every $y\in[0,1]$ .

Moreover, setting $\Psi^{x}_{t}:=x\exp\{(\beta_{2}-\frac{1}{2}\sigma^{2})t+\sigma I_{t}\}$ and introducing the one-dimensional optimal stopping problem

[TABLE]

one has that $v(x,y)\leq v_{\star}(x)$ for any $(x,y)\in\mathcal{O}$ . Following arguments as those employed above, the last inequality implies that $d(y)\geq x_{\star}$ for all $y\in[0,1]$ , where $x_{\star}:=\inf\{x>0:\,v_{\star}(x)\geq x\}\in(0,\infty)$ . ∎

4.1.2. Smooth-Fit Property and Continuity of the Free Boundary

We now aim at proving further regularity of $v$ and of the free boundary $d$ .

The second-order linear elliptic differential operator

[TABLE]

acting on any function $f\in C^{2}(\mathcal{O})$ , is the infinitesimal generator of the process $(\widehat{X},\pi)$ . The nondegeneracy of the process $(\widehat{X},\pi)$ , the smoothness of the coefficients in (4.1.2), together with the subharmonic characterization of $v$ , allow to prove by standard arguments (see, e.g., [40], Ch. 3, Sec. 7.1) and classical regularity results for elliptic partial differential equations (see, e.g., [27]) the following result.

Lemma 4.6.

The value function $v$ of (4.4) belongs to $C^{2}$ separately strictly inside $\mathcal{C}$ and $\mathcal{S}$ (i.e. away from the boundary $\partial\mathcal{C}$ of $\mathcal{C}$ ). Moreover, inside $\mathcal{C}$ it uniquely solves

[TABLE]

with $\mathbb{L}$ as in (4.1.2).

We continue our analysis by proving that the value function of (4.4) belongs to $C^{1}((0,\infty)\times(0,1))$ . This will be obtained through probabilistic methods that rely on the regularity (in the sense of diffusions) of the stopping set $\mathcal{S}$ for the process $(\widehat{X},\pi)$ (see [17] where this methodology has been recently developed in a general context; for other examples refer to [16] and [31]). Recall that the boundary points are regular for $\mathcal{S}$ relative to $(\widehat{X},\pi)$ if (cf. Definition 2.9 p. 249 in [33])

[TABLE]

The time $\widehat{\tau}(x_{o},y_{o})$ is the first hitting time of $(\widehat{X}^{x_{o},y_{o}},\pi^{y_{o}})$ to $\mathcal{S}$ .

Notice that for every bounded Borel function $f:\mathbb{R}^{2}\mapsto\mathbb{R}$ one has $\mathsf{E}_{(x,y)}\big{[}f(\widehat{X}_{t},\pi_{t})\big{]}=\mathsf{E}_{(u,y)}\big{[}f(e^{U_{t}},\pi_{t})\big{]}$ , where $u:=\ln(x)$ and $U_{t}:=\ln(\widehat{X}_{t})$ is such that $dU_{t}=\big{(}\beta_{2}+(g_{2}-g_{1})\pi_{t}-\frac{1}{2}\sigma^{2}\big{)}dt+\sigma dI_{t}$ . Due the nondegeneracy of the process $(U,\pi)$ , and the smoothness and boundedness of its coefficients, we have that $(U,\pi)$ has a continuous transition density $\widehat{p}(\cdot,\cdot,\cdot;u,y)$ , $(u,y)\in\mathbb{R}\times(0,1)$ , such that for any $t\geq 0$ and $(u^{\prime},y^{\prime})\in\mathbb{R}\times(0,1)$ (see, e.g., [1])

[TABLE]

for some constants $M>m>0$ and $\Lambda>\lambda>0$ . It thus follows that $(u,y)\mapsto\mathsf{E}_{(u,y)}\big{[}f(e^{U_{t}},\pi_{t})\big{]}$ is continuous, so that $(U,\pi)$ is a strong Feller process. Hence, $(\widehat{X},\pi)$ is strong Feller as well, and we can therefore conclude that (4.22) holds true if and only if (see [18], pp. 32-40)

[TABLE]

where $\tau^{\star}$ is as in (4.12).

The next proposition shows the validity of (4.22).

Proposition 4.7.

The boundary points in $\partial\mathcal{C}$ are regular for $\mathcal{S}$ relative to $(\widehat{X},\pi)$ ; that is, (4.22) holds.

Proof.

Let $(x_{o},y_{o})\in\partial\mathcal{C}$ , and set $u_{o}:=\ln(x_{o})$ . With $U$ as defined above, we set $\widehat{\sigma}(u_{o},y_{o}):=\widehat{\tau}(e^{u_{o}},y_{o})$ , $(u_{o},y_{o})\in\mathbb{R}\times(0,1)$ , and we equivalently rewrite (4.22) in terms of the process $(U,\pi)$ as

[TABLE]

Given that $y\mapsto\ln(d(y))$ is increasing (since $y\mapsto d(y)$ is such), then the region $\widehat{\mathcal{S}}:=\{(u,y)\in\mathbb{R}\times(0,1):\,u\geq\ln(d(y))\}$ enjoys the so-called cone property (see [33], p. 250). In particular, we can always construct a cone $C_{o}$ with vertex in $(u_{o},y_{o})$ and aperture $0\leq\phi\leq\pi/2$ such that $C_{o}\cap(\mathbb{R}\times(0,1))\subseteq\widehat{\mathcal{S}}$ , and for any $t_{o}\geq 0$ we can write that

[TABLE]

Then using (4.1.2) one has

[TABLE]

where we have used that the change of variable $u^{\prime}:=(u-u_{o})/\sqrt{t_{o}}$ and $y^{\prime}:=(y-y_{o})/\sqrt{t_{o}}$ maps the cone $C_{o}$ into itself. The number $\ell$ above depends on $u_{o},y_{o}$ , but it is independent of $t_{o}$ . From (4.25) and (4.1.2) we thus have that $\mathsf{P}(\widehat{\sigma}(u_{o},y_{o})\leq t_{o})\geq\ell$ , and letting $t_{o}\downarrow 0$ we obtain $\mathsf{P}(\widehat{\sigma}(u_{o},y_{o})=0)\geq\ell>0$ . However, $\{\widehat{\sigma}(u_{o},y_{o})=0\}\in\mathcal{H}_{0}$ , and by the Blumenthal’s 0-1 Law we obtain $\mathsf{P}(\widehat{\sigma}(u_{o},y_{o})=0)=1$ , which completes the proof. ∎

Theorem 4.8.

One has that $v\in C^{1}(\mathcal{O})$ .

Proof.

The value function belongs to $C^{2}$ strictly inside the continuation region due to Lemma 4.6, and it is $C^{\infty}$ strictly inside the stopping region where $v=x$ . It thus only remains to prove that $v$ is continuously differentiable across $\partial\mathcal{C}$ . In the following, we will prove that: (i) the function $\overline{w}:=\frac{1}{x}(v-x)$ has continuous derivative with respect to $x$ across $\partial\mathcal{C}$ (and this clearly implies the continuity of $v_{x}$ across $\partial\mathcal{C}$ ); (ii) that the function $v_{y}$ is continuous across $\partial\mathcal{C}$ .

(i) Continuity of $v_{x}$ across $\partial\mathcal{C}$ . For the subsequent arguments it is useful to notice that the function $\overline{w}=\frac{1}{x}(v-x)$ admits the representation (recall (4.7))

[TABLE]

and to bear in mind that the optimal stopping time $\tau^{\star}$ for $v$ as in (4.12) is also optimal for $\overline{w}$ since $v\geq x$ if and only if $\overline{w}\geq 0$ . We now prove that $\overline{w}_{x}$ is continuous across $\partial\mathcal{C}$ , thus implying continuity of $v_{x}$ across $\partial\mathcal{C}$ .

Take $(x,y)\in\mathcal{C}$ , and let $\varepsilon>0$ be such that $x-\varepsilon>0$ . Since $x\mapsto\overline{w}(x,y)$ is increasing (due to the monotonicity of $h^{\prime}$ ) it is clear that $(x-\varepsilon,y)\in\mathcal{C}$ as well. Denote by $\tau^{\star}_{\varepsilon}(x,y):=\tau^{\star}(x-\varepsilon,y)$ the optimal stopping time for $\overline{w}(x-\varepsilon,y)$ , and notice that $\tau^{\star}_{\varepsilon}(x,y)$ is suboptimal for $\overline{w}(x,y)$ and $\tau^{\star}_{\varepsilon}(x,y)\rightarrow\tau^{\star}(x,y)$ a.s. To simplify exposition in the following we write $\tau^{\star}_{\varepsilon}:=\tau^{\star}_{\varepsilon}(x,y)$ and $\tau^{\star}:=\tau^{\star}(x,y)$ . We can then write from (4.27)

[TABLE]

for some $\xi_{\varepsilon}\in(x-\varepsilon,x)$ , and where in the last step we have used the mean value theorem, and the fact that $\widehat{X}^{x,y}_{t}-\widehat{X}^{x-\varepsilon,y}_{t}=\varepsilon\widehat{X}^{1,y}_{t}$ . Letting $\varepsilon\downarrow 0$ , invoking the dominated convergence theorem (thanks to the fact that $\rho>\big{(}\gamma\beta_{2}+\frac{1}{2}\sigma^{2}\gamma(\gamma-1)\big{)}\vee\big{(}2\beta_{2}+\sigma^{2}\big{)}$ by Assumption 4.2), and using that $\overline{w}\in C^{1}(\mathcal{C})$ (since $v\in C^{1}(\mathcal{C})$ ), we then find from the latter that

[TABLE]

Let now $(x_{o},y_{o})$ be any arbitrary point belonging to $\partial\mathcal{C}$ . Taking limits in (4.28) as $(x,y)\rightarrow(x_{o},y_{o})$ , by the dominated convergence theorem and thanks to Proposition 4.7 we obtain that

[TABLE]

thus proving that $\overline{w}_{x}$ is continuous across $\partial\mathcal{C}$ . This immediately implies the continuity of $v_{x}$ across $\partial\mathcal{C}$ , upon recalling that $v=x(\overline{w}+1)$ .

(ii) Continuity of $v_{y}$ across $\partial\mathcal{C}$ . Take again $(x,y)\in\mathcal{C}$ , and let $\varepsilon>0$ be such that $y+\varepsilon<1$ . Since $y\mapsto v(x,y)$ is decreasing (cf. Proposition 4.3-(ii)), it is clear that $(x,y+\varepsilon)\in\mathcal{C}$ as well. Denote by $\tau^{\star}_{\varepsilon}(x,y):=\tau^{\star}(x,y+\varepsilon)$ the optimal stopping time for $v(x,y+\varepsilon)$ and notice that $\tau^{\star}_{\varepsilon}(x,y)$ is suboptimal for $v(x,y)$ and $\tau^{\star}(x,y+\varepsilon)\rightarrow\tau^{\star}(x,y)$ a.s. as $\varepsilon\downarrow 0$ . In order to simplify the notation, in the following we write $\tau^{\star}_{\varepsilon}$ instead of $\tau^{\star}_{\varepsilon}(x,y)$ .

From Proposition 4.3-(ii) and (4.7) we can then write

[TABLE]

Now, add and subtract both $\mathsf{E}[\int_{0}^{\tau^{\star}_{\varepsilon}}e^{-\rho t}\widehat{X}^{x,y+\varepsilon}_{t}h^{\prime}(\widehat{X}^{x,y}_{t})dt]$ and $(g_{2}-g_{1})\mathsf{E}[\int_{0}^{\tau^{\star}_{\varepsilon}}e^{-\rho t}\widehat{X}^{x,y+\varepsilon}_{t}\pi^{y}_{t}dt]$ in the right-hand side of the latter, and recall that $(g_{2}-g_{1})<0$ , that $\widehat{X}^{x,y}_{t}\geq 0$ a.s. for every $t\geq 0$ , as well as that $(\pi^{y+\varepsilon}_{t}-\pi^{y}_{t})\geq 0$ a.s. for every $t\geq 0$ . Then, after rearranging terms and employing the integral mean value theorem (for some $L_{t}^{\varepsilon}\in(\widehat{X}^{x,y+\varepsilon}_{t},\widehat{X}^{x,y}_{t})$ a.s.), we obtain from the equation above that

[TABLE]

In the last inequality we have used that $\rho-\beta_{2}-\pi^{y}_{t}(g_{2}-g_{1})\geq 0$ , since $\rho>\beta_{2}$ by Assumption 4.2, that $g_{2}-g_{1}<0$ , and that $\widehat{X}^{x,y+\varepsilon}_{t}\leq\widehat{X}^{x,y}_{t}$ .

Define now $\Delta\pi^{y}_{t}:=\frac{1}{\varepsilon}(\pi^{y+\varepsilon}_{t}-\pi^{y}_{t})$ , $t\geq 0$ , and notice that, by using the second equation in (4.2), we can write

[TABLE]

with $\Delta\pi^{y}_{0}=1$ . With the help of Itô’s formula, it can be easily shown that

[TABLE]

with $\theta^{2}:=\frac{1}{2}\big{[}\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}$ , solves the previous stochastic differential equation.

Also, by (4.3) and simple algebra,

[TABLE]

Employing the definition of $\Delta\pi^{y}_{t}$ and (4.31) in (4.1.2), and using that $\widehat{X}^{x,y+\varepsilon}_{t}\leq\widehat{X}^{x,y}_{t}$ , one finds

[TABLE]

We now aim at taking limits as $\varepsilon\downarrow 0$ in (4.1.2). To this end, notice that $\Delta\pi^{y}_{t}\rightarrow Z^{y}_{t}$ a.s. for all $t\geq 0$ , as $\varepsilon\downarrow 0$ , where, by Theorem $39$ in Chapter V.7 of [42], $(Z^{y}_{t})_{t\geq 0}$ is the unique strong solution to

[TABLE]

with $Z_{0}^{y}=1$ . Then, if we were allowed to invoke the dominated convergence theorem when taking limits as $\varepsilon\downarrow 0$ in (4.1.2), we would obtain that

[TABLE]

upon recalling that $v\in C^{2}(\mathcal{C})$ . Therefore, letting $(x_{o},y_{o})$ be any arbitrary point belonging to $\partial\mathcal{C}$ , by taking limits in (4.1.2) as $(x,y)\rightarrow(x_{o},y_{o})$ , by the dominated convergence theorem and thanks to Proposition 4.7 we obtain that

[TABLE]

thus proving that $v_{y}$ is continuous across $\partial\mathcal{C}$ .

In order to complete the proof it thus only remains to show that the dominated convergence theorem can indeed be applied when taking limits as $\varepsilon\downarrow 0$ in (4.1.2). This is what we are going to show in the two following technical steps.

Step 1. To prove that the dominated convergence theorem can be invoked when taking $\varepsilon\downarrow 0$ in the first expectation on the right-hand side of (4.1.2), we set

[TABLE]

and we show that the family of random variables $\{\Lambda_{\varepsilon},\varepsilon\in(0,1-y)\}$ is bounded in $L^{2}(\Omega,\mathcal{F},\mathsf{P})$ , hence uniformly integrable.

Notice that by Assumption 4.1-(ii) and the fact that $L_{t}^{\varepsilon}\leq\widehat{X}^{x,y}_{t}$ a.s., one has a.s. for any $t\geq 0$

[TABLE]

for some constant $\widehat{K}>0$ (independent of $\varepsilon$ ), so that by Jensen’s inequality

[TABLE]

Then, taking expectations and applying Hölder’s inequality

[TABLE]

for some other constant $K^{\prime}>0$ , independent of $\varepsilon$ , that in the following will be varying from line to line.

The standard inequality $1-e^{-x}\leq x$ , with $x=\varepsilon(g_{1}-g_{2})\int_{0}^{t}\Delta\pi^{y}_{s}ds\geq 0$ , allows us to continue from (4.35) and write

[TABLE]

We now treat the two expectations in (4.36) separately. First of all, notice that by Jensen’s inequality

[TABLE]

Second of all, thanks to the nonnegativity of $(\Delta\pi^{y})^{4}$ , we can invoke Fubini-Tonelli’s theorem and using also (4.37), obtain

[TABLE]

We now aim at evaluating the expectation in the last integral above.

To accomplish that, notice that by applying Itô’s formula to the process $\xi^{y}_{t}:=(\Delta\pi^{y}_{t})^{4}$ , and using (4.30), we have for any $t>0$

[TABLE]

with $\xi^{y}_{0}=1$ and $\theta^{2}=\frac{1}{2}\big{[}\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}+(\alpha_{1}-\alpha_{2})^{2}\big{]}$ . Because $(1-\pi^{y+\varepsilon}_{t}-\pi^{y}_{t})^{2}\leq 2$ a.s. for all $t\geq 0$ , and

[TABLE]

where $(M^{y}_{t})_{t\geq 0}$ is an exponential martingale, it is easy to see that

[TABLE]

Using the latter estimate in (4.1.2), together with Assumption 4.2, we deduce that

[TABLE]

As for the second expectation in (4.36), Assumption 4.2 and standard estimates employing (4.3) (together with the fact that $(g_{2}-g_{1})\int_{0}^{t}\pi^{y}_{s}ds<0$ ) guarantee that it is finite. Moreover, it is independent of $\varepsilon$ . Combining this with (4.40) we thus find from (4.36) that $\sup_{\varepsilon\in(0,1-y)}\mathsf{E}\Big{[}\big{|}\Lambda_{\varepsilon}\big{|}^{2}\Big{]}^{\frac{1}{2}}<\infty,$ thus implying that the family of random variables $\{\Lambda_{\varepsilon},\varepsilon\in(0,1-y)\}$ is bounded in $L^{2}(\Omega,\mathcal{F},\mathsf{P})$ , hence uniformly integrable.

Step 2. We consider the second expectation on the right-hand side of (4.1.2), and setting

[TABLE]

we aim at proving that the family of random variables $\{\Xi_{\varepsilon},\varepsilon\in(0,1-y)\}$ is bounded in $L^{2}(\Omega,\mathcal{F},\mathsf{P})$ , hence uniformly integrable.

By Jensen’s inequality first, and Hölder’s inequality then, one finds that

[TABLE]

for some $\widehat{K}>0$ , independent of $\varepsilon$ .

The first expectation on the right-hand side of (4.41) is finite thanks to Assumption 4.2 and standard estimates employing (4.3) (together with the fact that $(g_{2}-g_{1})\int_{0}^{t}\pi^{y}_{s}ds<0$ ). Moreover, it is independent of $\varepsilon$ .

As for the second one, by interchanging expectation and time integral by Fubini-Tonelli’s theorem, and using (4.39), we obtain

[TABLE]

due to Assumption 4.2. We therefore conclude that (cf. (4.41)) $\sup_{\varepsilon\in(0,1-y)}\mathsf{E}\Big{[}\big{|}\Xi_{\epsilon}|^{2}\Big{]}^{\frac{1}{2}}<\infty$ , thus completing the proof. ∎

The previous theorem in particular implies the so-called smooth-fit property, a well known optimality principle in optimal stopping theory. Moreover, by standard arguments based on the strong Markov property of $(\widehat{X},\pi)$ (see Chapter III in [40]) it follows from the results collected so far that the couple $(v,d)$ solves the free-boundary problem

[TABLE]

with $v\in C^{2}(\mathcal{C})$ .

An important consequence of Theorem 4.8 is the following.

Proposition 4.9.

One has that $y\mapsto d(y)$ is continuous on $[0,1]$ .

Proof.

Define the probability measure $\widehat{\mathsf{P}}$ on $(\Omega,\mathcal{F})$ such that $\frac{d\widehat{\mathsf{P}}}{d\mathsf{P}}\Big{|}_{\mathcal{F}_{t}}=e^{-\frac{1}{2}\sigma^{2}t+\sigma I_{t}}$ , $t\geq 0$ . Such a measure is equivalent to $\mathsf{P}$ on $\mathcal{F}_{t}$ , and defining $\widehat{I}_{t}:=I_{t}-\sigma t$ , by Girsanov’s theorem the latter is a standard $\mathbb{H}$ -Brownian motion under $\widehat{\mathsf{P}}$ .

By a change of measure (see, e.g., Section 12 in Chapter IV of [40]) it is then not difficult to see that $v$ as in (4.7) is such that $v(x,y):=x-\widehat{V}(x,y)$ , where, for any $(x,y)\in\mathcal{O}$ , we have set

[TABLE]

with $\widehat{H}(x,y):=\Big{(}\rho-\beta_{2}-(g_{2}-g_{1})y-h^{\prime}(x)\Big{)}$ . In (4.43) above $\widehat{\mathsf{E}}_{(x,y)}$ denotes the expectation conditioned on the fact that $(\widehat{X}_{0},\pi_{0})=(x,y)$ $\widehat{\mathsf{P}}$ -a.s. Since $\{(x,y)\in\mathcal{O}:\,v(x,y)\geq x\}=\{(x,y)\in\mathcal{O}:\,\widehat{V}(x,y)\leq 0\}$ , $d(\,\cdot\,)$ is the optimal stopping boundary for the problem with value $\widehat{V}$ as well.

In order to prove the continuity of $d(\,\cdot\,)$ , we now aim at applying Theorem 10 in [41] for problem (4.43). Notice that $\widehat{V}_{x}\leq 0$ on $\mathcal{O}$ since $x\mapsto h(x)$ is strictly convex. Moreover, recalling $\theta^{2}=\frac{1}{2}[(\alpha_{1}-\alpha_{2})^{2}+\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}]$ , we have $\partial_{x}\left(\frac{\widehat{H}}{\theta^{2}y^{2}(1-y)^{2}}\right)<0$ on $\mathcal{O}$ thanks, again, to the strict convexity of $h$ . Also, $\widehat{V}_{y}$ is continuous across the boundary, due to the $C^{1}$ -property shown in Theorem 4.8 for $v=x-\widehat{V}$ ; hence, the horizontal smooth-fit property holds.

We can therefore apply Theorem 10 of [41] (upon noticing that in [41] $x$ is the horizontal axis and $y$ is the vertical one, while, in our paper, $x$ is the vertical axis and $y$ is the horizontal one), and conclude that $d$ cannot have discontinuities of the first kind at any point $y\in[0,1)$ . Finally, $d$ is also continuous at $y=1$ since it is left-continuous by Proposition 4.5-(ii). ∎

4.2. The Optimal Control for Problem (P3)

In this section, we provide the form of the optimal debt reduction policy. It is given in terms of the free boundary studied in the previous section.

For $d$ as in (4.14), introduce under $\mathsf{P}_{(x,y)}$ the nondecreasing process

[TABLE]

with $\overline{\nu}^{\star}_{0^{-}}=0$ , and then the process

[TABLE]

Notice that since $\overline{\nu}^{\star}_{t}\leq x$ a.s. for all $t\geq 0$ , and $t\mapsto\overline{\nu}^{\star}_{t}$ is nondecreasing, it does follows from (4.45) that $\nu^{\star}$ is admissible. Moreover, $t\mapsto\overline{\nu}^{\star}_{t}$ is continuous (with the exception of a possible initial jump at initial time), due to the continuity of $y\mapsto d(y)$ and to that of $t\mapsto I_{t}$ , $t\mapsto\pi_{t}$ , and $t\mapsto\int_{0}^{t}\pi_{s}ds$ .

Theorem 4.10.

Let $\widetilde{V}(x,y):=\int_{0}^{x}\frac{1}{z}v(z,y)dz$ , $(x,y)\in[0,\infty)\times[0,1]$ . Then one has $\widetilde{V}=V$ on $[0,\infty)\times[0,1]$ , and $\nu^{\star}$ as in (4.45) is optimal for Problem (P3).

Proof.

Recall $U=U_{0}$ as in (3.26), and notice that in our Markovian setting one has $\frac{1}{z}v(z,y)=U(z)$ . By the proof of Theorem 3.13 it suffices to show that the right-continuous inverse of the stopping time $\tau^{\star}(z,y)=\ \inf\{t\geq 0\ |\ \widehat{X}^{z,y}_{t}\geq d(\pi^{y}_{t})\}$ (which is optimal for $v(z,y)$ , cf. (4.12)) coincides (up to a null set) with $\overline{\nu}^{\star}$ .

Then, recall (3.34) from the proof of Theorem 3.13, fix $(x,y)\in(0,\infty)\times(0,1)$ , take $t\geq 0$ arbitrary, and notice that by (4.12) we have $\mathsf{P}_{(z,y)}$ -a.s. the equivalences

[TABLE]

Hence, $\tau^{\overline{\nu}^{\star}}_{+}(z)=\tau^{\star}(z,y)$ a.s., and $\overline{\nu}^{\star}_{\cdot}$ is the right-continuous inverse of $\tau^{\star}(\cdot,y)$ . Since $\overline{\nu}^{\star}$ is admissible, by arguing as in Step 2 of the proof of Theorem 3.13 the claim follows. ∎

Notice that the equation of $X^{x,y,\nu}$ in the formulation of Problem (P3), and (4.45), yield

[TABLE]

which, with regard to (4.44), shows that

[TABLE]

Moreover, it is easy to see that we can express $\overline{\nu}^{\star}$ of (4.44) as

[TABLE]

The previous equations allow us to make some remarks about the optimal debt management policy of our problem.

(i)

If at initial time the level of the debt ratio $x$ is above $d(y)$ , then an immediate lump sum reduction of amplitude $(x-d(y))$ is optimal.

(ii)

At any $t\geq 0$ , it is optimal to keep the debt ratio level below the belief-dependent ceiling $d$ .

(iii)

If the level of the debt ratio at time $t$ is strictly below $d(\pi_{t})$ , there is no need for interventions. The government should intervene to reduce its debt only at those (random) times $t$ at which the debt ratio attempts to rise above $d(\pi_{t})$ . These interventions are then minimal, in the sense that $(X^{x,y,\nu^{\star}},\pi^{y},\nu^{\star})$ solves a Skorokhod reflection problem at the free boundary $d$ .

(iv)

Recall that the debt ceiling $d$ is an increasing function of the government’s belief that the economy is enjoying a phase of fast growth. Then, with regard to the previous description of the optimal debt reduction rule, we have that the more the government believes that the economy is in a good shape, the more the fiscal space is, and the less strict the optimal debt reduction policy should be.

4.3. Regularity of the Value Function of Problem (P3) and Related HJB Equation

Combining the results collected so far, we are now able to prove that the value function $V$ of control Problem (P3) is a twice-continuously differentiable function. As a byproduct, $V$ is a classical solution to the corresponding Hamilton-Jacobi-Bellman (HJB) equation.

By Theorem 4.10 we know that $V(x,y)=\int_{0}^{x}\frac{1}{z}v(z,y)dz$ , for all $(x,y)\in\overline{\mathcal{O}}:=[0,\infty)\times[0,1]$ . Hence, thanks to Theorem 4.8 and to the dominated convergence theorem, we immediately obtain the following result.

Lemma 4.11.

One has that $V\in C^{1}(\mathcal{O})\cap C(\overline{\mathcal{O}})$ . Moreover, $V_{xx}\in C(\mathcal{O})$ , as well as $V_{xy}\in C(\mathcal{O})$ .

To take care of the second derivative $V_{yy}$ we follow ideas used in [14]. In particular, we determine the second weak derivative of $V$ (recall that $V_{y}$ is continuous by Theorem 4.8), and we then show that it is a continuous function. This is accomplished in the next proposition.

Proposition 4.12.

Let $\theta^{2}:=\frac{1}{2}[(\alpha_{1}-\alpha_{2})^{2}+\frac{(g_{2}-g_{1})^{2}}{\sigma^{2}}]$ . We have $V_{yy}\in C(\mathcal{O})$ with

[TABLE]

Proof.

Notice that $V_{y}(x,y)=\int_{0}^{x}\frac{1}{z}v_{y}(z,y)dz$ , and therefore $V_{y}(x,\cdot)$ is a continuous function for all $x>0$ by Theorem 4.8 (notice indeed that by the bounds in (4.1.2) and the multiplicative dependence of $\widehat{X}^{z,y}$ with respect to $z$ one has that $\frac{1}{z}v_{y}(z,y)$ is integrable at zero). Hence, its weak derivative with respect to $y$ is a function $g\in L^{1}_{loc}(\mathcal{O})$ such that for any test function $\varphi\in C^{\infty}_{c}((0,1))$ one has

[TABLE]

We now aim at evaluating $g$ and at showing that it coincides with the right-hand side of (4.12).

Denote by $m(x)$ , $x>0$ , the generalized right-continuous inverse of $d(y)$ , $y\in[0,1]$ ; that is, $m(x):=\inf\{y\in[0,1]:\,d(y)\geq x\}$ . Then, noticing that $v_{y}=0$ on $\{(x,y)\in\mathcal{O}:\,x>d(y)\}$ and using Fubini’s theorem, we can write

[TABLE]

where we have used that $v_{y}(z,m(z))=0$ for all $z\in(0,x)$ , $x>0$ , as well as $\varphi(1)=0$ .

By Lemma 4.6 (cf. also (4.1.2)), for any $y>m(z)$ , for any $z\in(0,x)$ with $x>0$ , we have that

[TABLE]

Inserting the latter expression in the last integral term on the right-hand side of (4.3), using again Fubini’s theorem and then integrating the derivatives with respect to $x$ , we find

[TABLE]

where we have also used that $h(0)=0$ . Finally, setting

[TABLE]

we see that (4.3) reads $\int_{0}^{1}V_{y}(x,y)\varphi^{\prime}(y)dy=-\int_{0}^{1}g(x,y)\varphi(y)dy$ , so that $g$ identifies with the second weak derivative of $V$ with respect to $y$ . Notice that $g$ is continuous by the continuity of $d$ , $v$ , $v_{x}$ , $h$ , and the fact that $\int_{0}^{x\wedge d(y)}\frac{1}{z}v(z,y)dz$ and $\int_{0}^{x\wedge d(y)}\frac{1}{z}v_{y}(z,y)dz$ are finite due to (4.3), (4.4), and (4.1.2). The proof is therefore completed. ∎

Thanks to Lemma 4.11 and Proposition 4.12 we have that $V\in C^{2}(\mathcal{O})\cap C(\overline{\mathcal{O}})$ . As a byproduct of this, by the Dynamic Programming Principle and standard means based on an application of Dynkin’s formula, we obtain the next result.

Proposition 4.13.

Recall the second-order differential operator $\mathbb{L}$ defined in (4.1.2). The value function $V$ of Problem (P3) is a classical solution to the HJB equation

[TABLE]

with boundary condition $V(0,y)=0$ for any $y\in[0,1]$ .

Acknowledgments

The research of Claudia Ceci is partially supported by “Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni” (GNAMPA) of “Istituto Nazionale di Alta Matematica” (INdAM). Financial support by the German Research Foundation (DFG) through the Collaborative Research Centre 1283 is gratefully acknowledged by Giorgio Ferrari. We wish to thank Luciano Campi, Tiziano De Angelis, Paola Mannucci, Fabio Paronetto, Paavo Salminen, and Wolfgang Runggaldier for useful discussions.

Appendix A Filtering Results

Proof of Proposition 3.5.

Since the innovation processes $(I,I^{1})$ (see (3.3)) and the random measure $m(dt,dq)$ (see (3.6) and (3.8)) are $\mathbb{H}$ -adapted, then $\mathbb{F}^{I}\vee\mathbb{F}^{I^{1}}\vee\mathbb{F}^{m}\subseteq\mathbb{H}$ . In general, the latter inclusion could be strict. Let us now consider the exponential $\mathbb{F}$ -martingale solving

[TABLE]

and define a probability measure $\mathbb{Q}$ on $(\Omega,\mathcal{F})$ , equivalent to $\mathsf{P}$ on $\mathcal{F}_{t}$ , and such that

[TABLE]

Notice that Assumption (3.1) ensures that $L$ is indeed an $\mathbb{F}$ -martingale. By Girsanov’s theorem, the processes

[TABLE]

are $(\mathbb{Q},\mathbb{F})$ -independent Brownian motions. We now prove that $\mathbb{F}^{\widetilde{W}}\vee\mathbb{F}^{\widetilde{B}}\vee\mathbb{F}^{m}=\mathbb{H}.$ On the one hand, the inclusion $\mathbb{F}^{\widetilde{W}}\vee\mathbb{F}^{\widetilde{B}}\vee\mathbb{F}^{m}\subseteq\mathbb{H}$ follows from the fact that ${\widetilde{W}}$ and ${\widetilde{B}}$ turn out to be $\mathbb{H}$ -adapted since they can be written as

[TABLE]

To prove the converse, let us observe that, under the probability measure $\mathbb{Q}$ , the process $X^{0}$ and $\eta$ solve the following stochastic differential equations

[TABLE]

respectively. Clearly $X^{0}$ is $\mathbb{F}^{\widetilde{W}}$ -adapted. Recalling (3.16), the solution to equation (A.4) can be constructed iteratively. More precisely, $\forall t\in[0,T_{1})$ , the process $\eta$ solves

[TABLE]

and for any time between two consecutive jump times, i.e. $t\in[T_{n},T_{n+1})$ , $n\geq 1$ , one has

[TABLE]

By Assumption 2.1, this sequence stochastic differential equations has a unique strong solution on any interval $[T_{n},T_{n+1})$ , and this in turn gives the unique strong solution $\eta$ to (A.4). Moreover, $\eta$ turns out to be $F^{\widetilde{W}}\vee\mathbb{F}^{\widetilde{B}}\vee\mathbb{F}^{m}$ -adapted.

Then, by applying Corollary III.4.3.1 in [30], we have that every $(\mathbb{Q},\mathbb{H})$ -local martingale $\widetilde{M}$ admits the decomposition

[TABLE]

where $\widetilde{\varphi}$ and $\widetilde{\psi}$ are $\mathbb{H}$ -adapted processes, and $\widetilde{w}$ is an $\mathbb{H}$ -predictable process indexed by $\mathbb{R}$ , such that for all $t\geq 0$

[TABLE]

Let now $M$ be a $(\mathsf{P},\mathbb{H})$ -local martingale, then $\widetilde{M}:=M\widetilde{L}^{-1}$ is a $(\mathbb{Q},\mathbb{H})$ -local martingale, where

[TABLE]

Taking into account (A.3), we have that $\widetilde{L}$ solves

[TABLE]

and by applying the product formula to $M=\widetilde{M}\widetilde{L}$ , we easily obtain that

[TABLE]

To conclude, we thus only need to set

[TABLE]

∎

Proof of Theorem 3.6.

In order to derive the filtering equation solved by ${\underline{\pi}}_{t}=(\pi_{t}(i);i\in S)_{t\geq 0}$ , we apply the innovation approach (see, for instance, Chapter IV in [4]). In this proof we shall use the two well-known facts:

(i)

for every $\mathbb{F}$ -martingale $m$ , the projection over $\mathbb{H}$ is an $\mathbb{H}$ -martingale; that is, $\hat{m}_{t}:=\mathsf{E}[m_{t}|\mathcal{H}_{t}]$ , $t\geq 0$ , is an $\mathbb{H}$ -martingale;

(ii)

for any $\mathbb{F}$ -progressively measurable and integrable process $\Psi$ we have that

[TABLE]

is an $\mathbb{H}$ -martingale.

The first step of the innovation method consists in writing the process $\mathds{1}_{\{Z_{t}=i\}}$ , $i\in S$ , as a semimartingale. Denoting by $L^{Z}$ the Markov generator of the state process $Z$ , we have that

[TABLE]

where $f_{k}(j)=\mathds{1}_{\{j=k\}}$ . Hence, for any $i\in S$ , we can write

[TABLE]

where $(m_{t}(i))_{t\geq 0}$ is an $\mathbb{F}$ -martingale. By taking the conditional expectation with respect to $\mathcal{H}_{t}$ , and using (i) and (ii) above, we obtain that

[TABLE]

where $M(i)$ is an $\mathbb{H}$ -martingale null at zero. Proposition 3.5 ensures the existence of processes $\psi(i)$ and $\varphi(i)$ that are $\mathbb{H}$ -predictable, and $w_{i}$ which is $\mathbb{H}$ -predictable and indexed by $\mathbb{R}$ , such that

[TABLE]

with $\mathsf{E}[\int_{0}^{t}\varphi^{2}_{s}(i)ds]<\infty$ , $\mathsf{E}[\int_{0}^{t}\psi^{2}_{s}(i)ds]<\infty$ and $\mathsf{E}[\int_{0}^{t}\int_{\mathbb{R}}|w_{i}(s,q)|m^{p,\mathbb{H}}(dt,dq)]<\infty$ , $t\geq 0$ . To obtain equation (3.14) it only remains to prove that

[TABLE]

with $w^{\pi}_{i}$ given in (3.15).

Following the same lines of the proof of Theorem 3.1 in [11] we can derive the structure of the processes $\psi(i)$ , $\varphi(i)$ by imposing the following equalities

[TABLE]

where $\widetilde{W}$ and $\widetilde{B}$ are the $\mathbb{H}$ -Brownian motions defined in (A.2). To derive the expression of $w_{i}$ , we consider a bounded process $\Gamma$ of the form $\Gamma_{t}=\int_{0}^{t}\int_{\mathbb{R}}\gamma(s,q)m(ds,dq)$ , with $\gamma$ $\mathbb{H}$ -predictable process indexed by ${\mathbb{R}}$ . Since $\Gamma$ is $\mathbb{H}$ -adapted, the equality

[TABLE]

holds. By applying the product rule (taking into account no common jumps between $Z$ and $N$ ) we obtain

[TABLE]

where $m^{p,\mathbb{F}}(dt,dq)$ is the $\mathbb{F}$ -dual predictable projection of $m(dt,dq)$ given in (3.12), and $\mathcal{M}^{\mathbb{F}}$ is an $\mathbb{F}$ -martingale. By projection onto $\mathcal{H}_{t}$ , and denoting by $\mathcal{M}^{\mathbb{H}}$ an $\mathbb{H}$ -martingale, we have that

[TABLE]

On the other hand, the product rule and (A.5) and (A.6) yield

[TABLE]

Recalling that $m^{p,\mathbb{H}}(dt,dq)$ is the $\mathbb{H}$ -dual predictable projection of $m(dt,dq)$ given in (3.11), we find

[TABLE]

where, again, $\mathcal{M}^{\mathbb{H}}$ is an $\mathbb{H}$ -martingale.

Gathering equations (A.7), (A.8), and (A.9), we obtain that for a.e. $t\geq 0$

[TABLE]

Choose now $\gamma(t,q)$ of the form $\gamma(t,q)=C_{t}\mathds{1}_{A}(q)\mathds{1}_{t\leq T_{n}}$ , with $C$ any bounded, $\mathbb{H}$ -predictable, positive process and $A\in{\mathcal{B}}(\mathbb{R})$ . Observe that $\Gamma$ is bounded since $|\Gamma_{t}|\leq\int_{0}^{t\wedge T_{n}}C_{s}dN_{s}\leq Dn$ , with $D$ a positive constant. Then the following equality holds on $\{t\leq T_{n}\}$

[TABLE]

where we have set

[TABLE]

Thus, on $\{t\leq T_{n}\}$ ,

[TABLE]

Finally, since the counting process $N$ is nonexplosive, $T_{n}\uparrow\infty$ a.s. for $n\uparrow\infty$ , and this yields (3.15). ∎

Proof of Proposition 3.10.

By Proposition 3.7, equations (3.14) and (3.21) are equivalent to a system of recursive equations between consecutive jump times, i.e. for $t\in[T_{n},T_{n+1})$ , $n=0,1,\dots$

[TABLE]

where we have set

[TABLE]

with the update at time $T_{n}$ given by

[TABLE]

Recall that, by assumption, the function $\alpha(q,i)$ given in (3.4) is locally-Lipschitz with respect to $q$ and satisfies a (global) sublinear growth condition with respect to $q\in\mathcal{I}$ , uniformly in $i\in S$ .

We now develop the proof of uniqueness by distinguishing among three different cases related to the jumps’ amplitude $c$ ; namely, $c\neq 0$ , $c=0$ , and $c\in\mathbb{R}$ .

In the case $c\neq 0$ , we have that $b^{\pi}(\underline{y},q,i):=\sum_{j=1}^{Q}\lambda_{ji}y_{j}-y_{i}\Big{[}\lambda^{N}(i)-\sum_{j=1}^{Q}\lambda^{N}(j)y_{j}\Big{]},$ and it is easy to verify that between two consecutive jump times the pair $(\underline{\pi},\eta)$ solves a $(Q+1)$ -dimensional stochastic differential equation with coefficients satisfying locally-Lipschitz conditions and (global) sublinear growth conditions with respect to $(\underline{y},q)\in\mathcal{Y}\times\mathbb{R}$ , uniformly in $i\in S$ . As a consequence, strong uniqueness holds between two consecutive jump times; i.e. for $t\in[T_{n-1},T_{n})$ , $n=1,\dots$ . Moreover, since the update at jump time $T_{n}$ (see (A.10)) depends on the process $(\underline{\pi}_{t},\eta_{t})$ for $t\in[T_{n-1},T_{n})$ , we have strong uniqueness of the solution to system (3.14) and (3.21) for all $t\geq 0$ .

In the case $c=0$ , equations (3.14) and (3.21) reduce to

[TABLE]

where, in particular, $b^{\pi}(\underline{y},q,i)=\sum_{j=1}^{Q}\lambda_{ji}y_{j}$ . It is easy to check that also in this case strong uniqueness follows by the locally-Lipschitz property of the coefficients and by their (global) sublinear growth condition.

In the case $c\in\mathbb{R}$ the jumps’ amplitude can assume any possible real value. In particular, $c$ can be such that $\eta$ and $N$ do not have only common jumps: $N$ might jump at a time at which $c(\eta_{t^{-}},Z_{t^{-}})=0$ , so that $\eta$ does not jump at that time. The treatment of this case is more delicate and should be performed separately. Indeed, the uniqueness cannot be proved by using the arguments employed in the previous two cases because of the presence of $\mathds{1}_{\{c(q,i)\neq 0\}}$ in the coefficient $b^{\pi}$ which prevents to prove Lipschitz-continuity of $b^{\pi}$ with respect to $q$ . However, one might prove uniqueness by relying on the filtered martingale problem associated to the infinitesimal generator of the triplet $(Z,X^{0},\eta)$ . We refer to the seminal paper [36] and to Theorem 3.3 and Appendix B in the more recent [11].

∎

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Aronson, D.G. (1967). Bounds for the Fundamental Solution of a Parabolic Equation. Bull. Amer. Math. Soc. 73(6) pp. 890–896.
2[2] Baldursson, F.M., Karatzas, I. ( 1997 ) 1997 (1997) . Irreversible Investment and Industry Equilibrium. Finance Stoch. 1 pp. 69–89.
3[3] Borodin, A.N., Salminen, P. (2015). Handbook of Brownian Motion - Facts and Formulae (2nd edition). Birkhäuser.
4[4] Brémaud, P. (1980). Point Processes and Queues: Martingale Dynamics . Springer-Verlag.
5[5] Cadenillas, A., Huamán-Aguilar, R. ( 2016 ) 2016 (2016) . Explicit Formula for the Optimal Government Debt Ceiling. Ann. Oper. Res. 247(2) pp. 415–449.
6[6] Cadenillas, A., Huamán-Aguilar, R. (2018). On the Failure to Reach the Optimal Government Debt Ceiling. Forthcoming on Risks .
7[7] Ceci, C., Gerardi, A. (1998). Partially Observed Control of a Markov Jump Process with Counting Observations: Equivalence with the Separated Problem. Stoch. Process. Appl. 78 , pp. 245–260.
8[8] Ceci, C., Gerardi, A. (2000). Filtering of a Markov Jump Process with Counting Observations. Appl. Math. Optim. 42(1) , pp. 1–18.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Optimal Reduction of Public Debt under Partial Observation of the Economic Growth

Abstract.

1. Introduction

2. Setting and Problem Formulation

2.1. The Setting

Assumption 2.1**.**

2.2. The Optimal Debt Reduction Problem

Remark 2.2**.**

Remark 2.3**.**

Assumption 2.4**.**

3. Reduction to an Equivalent Problem under Complete Information

3.1. The Filtering Problem

Assumption 3.1**.**

Definition 3.2**.**

Definition 3.3**.**

Proposition 3.4**.**

Proof.

Proposition 3.5**.**

Theorem 3.6**.**

Proposition 3.7**.**

Proof.

Remark 3.8**.**

3.2. The Separated Problem

Definition 3.9**.**

Proposition 3.10**.**

Proposition 3.11**.**

Remark 3.12**.**

3.3. A Probabilistic Verification Theorem via Reduction to Optimal Stopping

Theorem 3.13**.**

Proof.

Remark 3.14**.**

4. The Solution in a Case Study with Q=2Q=2Q=2 Economic Regimes

Assumption 4.1**.**

Assumption 4.2**.**

4.1. The Related Optimal Stopping Problem

4.1.1. Formulation and Preliminary Results

Proposition 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Proposition 4.5**.**

Proof.

4.1.2. Smooth-Fit Property and Continuity of the Free Boundary

Lemma 4.6**.**

Proposition 4.7**.**

Proof.

Theorem 4.8**.**

Proof.

Proposition 4.9**.**

Proof.

4.2. The Optimal Control for Problem (P3)

Theorem 4.10**.**

Proof.

4.3. Regularity of the Value Function of Problem (P3) and Related HJB Equation

Lemma 4.11**.**

Proposition 4.12**.**

Proof.

Proposition 4.13**.**

Acknowledgments

Appendix A Filtering Results

Proof of Proposition 3.5.

Proof of Theorem 3.6.

Proof of Proposition 3.10.

Assumption 2.1.

Remark 2.2.

Remark 2.3.

Assumption 2.4.

Assumption 3.1.

Definition 3.2.

Definition 3.3.

Proposition 3.4.

Proposition 3.5.

Theorem 3.6.

Proposition 3.7.

Remark 3.8.

Definition 3.9.

Proposition 3.10.

Proposition 3.11.

Remark 3.12.

Theorem 3.13.

Remark 3.14.

4. The Solution in a Case Study with $Q=2$ Economic Regimes

Assumption 4.1.

Assumption 4.2.

Proposition 4.3.

Lemma 4.4.

Proposition 4.5.

Lemma 4.6.

Proposition 4.7.

Theorem 4.8.

Proposition 4.9.

Theorem 4.10.

Lemma 4.11.

Proposition 4.12.

Proposition 4.13.