Learning from the past in an irreversible investment problem

Topias Tolonen-Weckstr\"om

arXiv:2508.21731·math.OC·October 1, 2025

Learning from the past in an irreversible investment problem

Topias Tolonen-Weckstr\"om

PDF

Open Access

TL;DR

This paper models an irreversible investment problem where learning from past information influences the timing of investments, using a recursive stopping problem approach with explicit boundaries.

Contribution

It introduces a novel recursive framework for investment decisions involving learning from past information, with semi-explicit solutions for optimal stopping boundaries.

Findings

01

Existence of one-sided stopping boundaries at each recursion step

02

Optimal investment strategy characterized by a sequence of semi-explicit boundaries

03

Numerical solutions and comparative statistics validate the approach

Abstract

We consider an irreversible investment problem under incomplete information, where the investor decides whether and when to make investments in a project. Upon investment, the investor acquires previously hidden information from the project's past (''learning from the past''), and so the learning rate of the problem is controlled by investing. We set up this original problem as an recursively defined stopping problem, where the learning rate is accelerated after each recursion step. To solve the problem, we show that at each step, there indeed exists a one-sided stopping boundary under general conditions. We proceed to present the optimal investment strategy as a sequence of semi-explicit stopping boundaries derived from smooth fit conditions. Feasibility of our approach is then demonstrated by solving boundaries numerically and by illustrating comparative statistics.

Equations127

{d X_{t} = μ d t + σ d W_{t} Y_{t} = X_{t + δ U_{t}},

{d X_{t} = μ d t + σ d W_{t} Y_{t} = X_{t + δ U_{t}},

E [\int_{0}^{\infty} e^{- r t} μ d U_{t}],

E [\int_{0}^{\infty} e^{- r t} μ d U_{t}],

{τ_{n}}_{n = 1}^{N} sup E [n = 1 \sum N e^{- r τ_{n}} μ Δ u_{n}],

{τ_{n}}_{n = 1}^{N} sup E [n = 1 \sum N e^{- r τ_{n}} μ Δ u_{n}],

d X_{t} = μ d t + σ d W_{t},

d X_{t} = μ d t + σ d W_{t},

u_{n} = \frac{N - n}{N} for n = 0, 1, \dots, N,

u_{n} = \frac{N - n}{N} for n = 0, 1, \dots, N,

Π_{t} = P (μ = μ_{1} ∣ F_{t}^{X}) .

Π_{t} = P (μ = μ_{1} ∣ F_{t}^{X}) .

d Π_{t} = ρ Π_{t} (1 - Π_{t}) d \tilde{W}_{t},

d Π_{t} = ρ Π_{t} (1 - Π_{t}) d \tilde{W}_{t},

\tilde{W}_{t} := \frac{1}{σ} (X_{t} - \int_{0}^{t} (μ_{0} + (μ_{1} - μ_{0}) Π_{s}) d s)

\tilde{W}_{t} := \frac{1}{σ} (X_{t} - \int_{0}^{t} (μ_{0} + (μ_{1} - μ_{0}) Π_{s}) d s)

E_{π} [e^{- r τ} μ (u_{n - 1} - u_{n})] = \frac{( μ _{1} - μ _{0} )}{N} E_{π} [e^{- r τ} (Π_{τ} - k)],

E_{π} [e^{- r τ} μ (u_{n - 1} - u_{n})] = \frac{( μ _{1} - μ _{0} )}{N} E_{π} [e^{- r τ} (Π_{τ} - k)],

V_{1} (π) = τ sup E_{π} [e^{- r τ} (Π_{τ} - k)] .

V_{1} (π) = τ sup E_{π} [e^{- r τ} (Π_{τ} - k)] .

V_{2} (π) = τ sup E_{π} [e^{- r τ} (Π_{τ} - k + E_{Π_{τ}} [V_{n - 1} (Π_{ε})])],

V_{2} (π) = τ sup E_{π} [e^{- r τ} (Π_{τ} - k + E_{Π_{τ}} [V_{n - 1} (Π_{ε})])],

{V_{1} (π) = sup_{τ} E_{π} [e^{- r τ} (Π_{τ} - k)] V_{n} (π) = sup_{τ} E_{π} [e^{- r τ} g_{n} (Π_{τ})]

{V_{1} (π) = sup_{τ} E_{π} [e^{- r τ} (Π_{τ} - k)] V_{n} (π) = sup_{τ} E_{π} [e^{- r τ} g_{n} (Π_{τ})]

g_{n} (π) := π - k + F_{n - 1} (π),

g_{n} (π) := π - k + F_{n - 1} (π),

F_{n - 1} (π) := E_{π} [V_{n - 1} (Π_{ε})],

F_{n - 1} (π) := E_{π} [V_{n - 1} (Π_{ε})],

Π_{ε} = π + \int_{0}^{ε} ρ Π_{t} (1 - Π_{t}) d \tilde{W}_{t},

Π_{ε} = π + \int_{0}^{ε} ρ Π_{t} (1 - Π_{t}) d \tilde{W}_{t},

0 \leq (π - k)^{+} \leq V_{1} (π) \leq F_{1} (π) \leq (1 - k) π .

0 \leq (π - k)^{+} \leq V_{1} (π) \leq F_{1} (π) \leq (1 - k) π .

0 \leq (n - 1) (π - k)^{+} \leq F_{n - 1} (π) \leq (n - 1) (1 - k) π,

0 \leq (n - 1) (π - k)^{+} \leq F_{n - 1} (π) \leq (n - 1) (1 - k) π,

V_{1} (π) = τ sup E [e^{- r τ} (Π_{τ} - k)]

V_{1} (π) = τ sup E [e^{- r τ} (Π_{τ} - k)]

τ = in f {t \geq 0 : Π_{t} \geq b_{1}}

τ = in f {t \geq 0 : Π_{t} \geq b_{1}}

⎩ ⎨ ⎧ (L V_{1}) (π) = 0, V_{1} (π) = π - k, V_{1}^{'} (π) = 1, V_{1} (0) = 0, on π < b_{1}, at π = b_{1}, at π = b_{1},

⎩ ⎨ ⎧ (L V_{1}) (π) = 0, V_{1} (π) = π - k, V_{1}^{'} (π) = 1, V_{1} (0) = 0, on π < b_{1}, at π = b_{1}, at π = b_{1},

L := \frac{ρ ^{2} π ^{2} ( 1 - π ) ^{2}}{2} \frac{d ^{2}}{d π ^{2}} - r .

L := \frac{ρ ^{2} π ^{2} ( 1 - π ) ^{2}}{2} \frac{d ^{2}}{d π ^{2}} - r .

A_{1} (1 - π) (\frac{π}{1 - π})^{γ} + B_{1} (1 - π) (\frac{π}{1 - π})^{γ_{-}}

A_{1} (1 - π) (\frac{π}{1 - π})^{γ} + B_{1} (1 - π) (\frac{π}{1 - π})^{γ_{-}}

G (π) := (1 - π) (\frac{π}{1 - π})^{γ},

G (π) := (1 - π) (\frac{π}{1 - π})^{γ},

G (0) = 0 and (L G) (π) = 0.

G (0) = 0 and (L G) (π) = 0.

V_{1} (π) = A_{1} G (π)

V_{1} (π) = A_{1} G (π)

{A_{1} G (b_{1}) = b_{1} - k A_{1} G^{'} (b_{1}) = 1.

{A_{1} G (b_{1}) = b_{1} - k A_{1} G^{'} (b_{1}) = 1.

G^{'} (π) = \frac{γ - π}{π ( 1 - π )} G (π)

G^{'} (π) = \frac{γ - π}{π ( 1 - π )} G (π)

b_{1} = \frac{γ k}{γ + k - 1} \geq k .

b_{1} = \frac{γ k}{γ + k - 1} \geq k .

\hat{V}_{1} (π) := {π - k, \frac{b _{1} - k}{G ( b _{1} )} G (π), if π \geq b_{1}, if π < b_{1} .

\hat{V}_{1} (π) := {π - k, \frac{b _{1} - k}{G ( b _{1} )} G (π), if π \geq b_{1}, if π < b_{1} .

V_{n} (π) = τ sup E [e^{- r τ} g_{n} (Π_{τ})] = τ sup E [e^{- r τ} (Π_{τ} - k + F_{n - 1} (Π_{τ}))]

V_{n} (π) = τ sup E [e^{- r τ} g_{n} (Π_{τ})] = τ sup E [e^{- r τ} (Π_{τ} - k + F_{n - 1} (Π_{τ}))]

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Capital Investment and Risk Analysis · Auction Theory and Applications

Full text

Learning from the past in an irreversible investment problem

Topias Tolonen-Weckström Department of Mathematics, Uppsala University. Box 256, 75105 Uppsala, Sweden. Email address:[email protected].

(September 30, 2025)

Abstract

We consider an irreversible investment problem under incomplete information, where the investor decides whether and when to make investments in a project. Upon investment, the investor acquires previously hidden information from the project’s past (”learning from the past”), and so the learning rate of the problem is controlled by investing. We set up this original problem as an recursively defined stopping problem, where the learning rate is accelerated after each recursion step. To solve the problem, we show that at each step, there indeed exists a one-sided stopping boundary under general conditions. We proceed to present the optimal investment strategy as a sequence of semi-explicit stopping boundaries derived from smooth fit conditions. Feasibility of our approach is then demonstrated by solving boundaries numerically and by illustrating comparative statistics.

Keywords: irreversible investment, incomplete information, recursive optimal stopping, free-boundary problem, control of learning rate, project acquisition.

Mathematics Subject Classification 2020: 60G40, 93E11, 91G99.

1 Introduction

Consider a Bayesian decision-maker (investor) whose objective is to decide whether and when to make irreversible investments in a project. The decision-maker makes noisy observations of the project value under incomplete information and knows that after each investment, they will learn more about the project. We call this notion learning from the past and it is the main point of interest in our study.

We model such a problem as

[TABLE]

where the process $X_{t}$ represents the observation process with $\sigma>0$ , a standard Brownian motion $W_{t}$ , and an unknown project value $\mu$ assuming a Bernoulli distribution with two possible values $\mu_{0}<0<\mu_{1}$ . In (1), $Y_{t}$ induces the learning-from-the-past effect, $\delta$ is a positive constant denoting amount of learning per unit of investment, and $(U_{t})_{t\geq 0}$ is an increasing control process with $U_{0}=0$ and $U_{t}\leq 1$ . The objective of the investor is then to control $U_{t}$ to maximize

[TABLE]

where $r$ is a known discount rate.

Intuitively, such a problem is solved by finding a suitable stopping boundary. However, we note that there are problems in the above problem formulation as the information available to the investor depends on the control process $U_{t}$ , which in turn should depend on the available information, creating a circular feedback between the observation process and admissible controls adapted to it. Also, the continuous formulation is challenging as it leads to rather involved regularity considerations along the curved boundary. Instead of formulating this problem precisely, in order to gain mathematical tractability and to focus our efforts on the study of the learning-from-the-past effect, we choose to study a discrete version of the problem.

In particular, we restrict the possible investment levels to only attain discrete values so that $U_{t}\in\{u_{0},u_{1},\ldots,u_{N}\}$ with $\{u_{n}\}_{n=0}^{N}$ decreasing, $u_{N}=0$ , and $u_{0}=1$ . Here the index $n$ in $u_{n}$ indicates that there are $n$ remaining investment possibilities, so $u_{N}$ is the initial level of investment and $u_{0}$ is the investment level after the last possible investment. Under such restriction, the control problem described in (1)–(2) collapses into a stopping problem. The investor seeks a sequence of investment times $\{\tau_{n}\}_{n=1}^{N}$ in order to optimize

[TABLE]

where $\Delta u_{n}=u_{n-1}-u_{n}$ . We set up this problem properly in Section 2.

Learning-from-the-past effect arises naturally in applications of irreversible investment problems. In particular, it is a natural model for cases of project acquisition, where the investor, upon acquisition, learns about the project’s (for example, a company) hidden intangible assets not accounted for in its public financial statements. When accessing the project as an insider, the investor gains insider information about such assets, which can include previously unaccounted and publicly hidden goodwill, intellectual property, human economic value (human resources), and organizational culture.

Our primary contribution is to set up and study an original problem of this type, deriving semi-explicit solutions for the optimal stopping problem. In our recursive problem formulation under incomplete information and additional learning, we show that at each investment step where the future values of subsequent investments are enveloped, there exists a one sided stopping boundary. To make the standard methods of optimal stopping go through, we provide a careful analysis of properties of different payoff functionals. We show that the boundaries characterizing the optimal investment times exist and are well-defined, as well as provide equations from which the boundaries can be solved numerically. Moreover, we demonstrate that the solution concept is feasible by providing numerical examples and comparative statistics.

1.1 Related literature

Our model of learning-from-the-past is original to our article. The model provides a new way of controlling learning rate in a optimal stopping problem in an irreversible investment setting under incomplete information.

Investment and utility maximization problems under incomplete information are well studied in the field of stochastic control. An early contemporary reference that combines stochastic control with incomplete information is [16]. A key study in early investment problems within the field is [6], where an investment timing problem under incomplete information with respect to an option payoff functional is studied. General investment timing problems are examined, for example, in [4], [5], and [18], while [11] examines an irreversible investment problem by characterizing the free boundary as the unique solution of an integral equation. Investment problems with a Bayesian setting under incomplete information are discussed for example in [14] and [24]. In particular, [24] discusses the relationship between belief of a favorable market and investment timing, closely resembling our set-up. In a recent effort, [12] studies an irreversible investment problem under incomplete information, where the investment is modeled as a geometric Brownian motion.

Main point of interest in our model, rarely discussed in optimal control research, is that the decision-maker controls the learning rate. Such stochastic control problems have been discussed only recently. Statistical problems of this type are considered in [3] with a problem of quickest detection with reversible controls, [7] with an estimation problem with costly observations, and [8] with a detection problem with irreversible controls and a linear cost to increase observation rate. [10] incorporates a irreversible investment problem, where an investment directly affects the drift coefficient of the observation process.

We construct a recursively defined stopping problem but initially motivate our model as a multiple stopping problem. Connection between the two is discussed in [1] under relatively general assumptions. In addition, [2] discusses multiple optimal stopping for American swing options, largely resembling our set-up.

The set-up in [10] also models a learning feature in an irreversible investment problem. In their set-up, they consider an example of project expansion. Upon investing, the investor begins to learn at an accelerated rate due to gaining more capacities of observing the development of the market, product testing, and realized demand or production costs, for example by setting up a new production unit. There is a crucial difference between their learning and our learning-from-the-past effect. In our model, the key application considers learning by acquiring already established units. Moreover, opposed to their set-up, the investor can’t directly affect the diffusion coefficient of the output process directly by increasing their investment level. Instead, the additional learning is modeled by speeding up the observation process upon investment. Realization of this accelerated process then reveals previously hidden information, inducing the learning-form-the-past effect. Despite these differences, both of the models study problems of irreversible investment under incomplete information, where the amount of learning is both controlled and monotone upon investment: investing more yields more information to the investor.

1.2 Structure of the article

The remainder of this article is organized as follows. In Section 2 we set up the model and define the key concepts we use to build our model. More specifically, we introduce a recursive stopping problem and introduce the learning-from-the-past effect as an accelerated learning rate, which the investor uses to evaluate evaluates the value of subsequent steps in the recursion. In Section 3, a candidate solution is characterized in terms of an optimal investment strategy and the corresponding stopping boundaries. In Section 4, we show that each investment step induces a one-sided stopping boundary. Main results are found in Section 5, where verification result for each step is provided together with a main theorem which shows that individual verification results go through when combined recursively, resulting in an optimal investment strategy for our problem. Finally, we illustrate our theoretical results with key numerical examples in Section 6.

2 Problem set-up

Let $(X_{t})_{t\geq 0}$ be a diffusion process with dynamics

[TABLE]

where $\sigma>0$ is a known constant and $W_{t}$ is a standard Brownian motion defined on a probability space $(\Omega,\mathcal{F},\mathbb{P})$ .

We consider an investor who is facing an optimization problem of investing to a project. The project value $\mu$ takes possible values $\mu_{0}$ and $\mu_{1}$ with $\mu_{0}<0<\mu_{1}$ . We model incomplete information, i.e. the lack of the investor’s information on $\mu$ , by letting the investor only observe realizations of $X_{t}$ . Let $\mathcal{F}^{X}_{t}$ be the completion of the $\sigma$ -algebra $\sigma\{X_{s}:0\leq s\leq t\}$ . Then, based on $\mathcal{F}^{X}_{t}$ , the investor is interested in determining optimal investment times to maximize their value based on the unknown project value $\mu$ . We assume the admissible investment levels to be of the form

[TABLE]

and that each upon each investment, the investment level is raised from $u_{n}$ to $u_{n-1}$ . That is, the possible levels of investment $\{u_{n}\}_{n=0}^{N}$ is a decreasing sequence with $u_{N}=0$ and $u_{0}=1$ .

To characterize the investor’s learning about $\mu$ by observing realizations of $X_{t}$ , we define the belief process of the decision-maker as the conditional probability

[TABLE]

From standard literature on filter theory, see for example [17], we find that the dynamics of the process $\Pi_{t}$ can be described as

[TABLE]

where

[TABLE]

is the so-called innovations process (an $\mathcal{F}_{t}^{X}-$ Brownian motion), $\rho:=\frac{\mu_{1}-\mu_{0}}{\sigma}$ is the signal-to-noise ratio, and $\Pi_{0}=\pi\in(0,1)$ is a known constant representing the investor’s prior information that $\mu=\mu_{1}$ (i.e. the probability $\mathbb{P}(\mu=\mu_{1})$ ).

It is well-known that $\Pi_{t}$ is a strong Markov process as it solves (7) (see, for example, [19]), and so we may embed the problem into a Markovian setting, and additionally optimization over $\mathcal{F}_{t}^{X}-$ stopping times coincides with optimization over $\mathcal{F}^{\Pi}_{t}-$ stopping times ( $\mathcal{F}^{\Pi}_{t}$ being the completion of $\sigma\{\Pi_{s}:0\leq s\leq t\}$ ).

Let $N$ be fixed so that $u_{n-1}-u_{n}=\frac{1}{N}$ for all $n=1,\ldots,N$ . Then, for any $F^{X}_{t}-$ stopping time $\tau$ , it follows from the tower property of conditional expectation and the Markov property of $\Pi_{t}$ that

[TABLE]

where $k=\frac{-\mu_{0}}{\mu_{1}-\mu_{0}}$ . (Observe that optimizing over the left-hand side and the right-hand side of (9) coincide).

Now consider the stopping problem (3) presented in Section 1. Upon the last possible investment, the investor wants to find an $\mathcal{F}_{t}^{\Pi}-$ stopping time $\tau$ to solve

[TABLE]

Then, upon the previous investment, it is intuitively clear (see [1] for a general reduction of a multiple stopping problem) that the investor optimizes over a discounted payoff $e^{-r\tau}(\Pi_{\tau}-k)$ together with an expected value of the remaining investment. That is, the investor solves

[TABLE]

where $\Pi_{\varepsilon}$ denotes the additional $\varepsilon$ units of observing the process $\Pi_{t}$ .

These steps can be propagated up to $N$ steps, and so solving for (3) reduces into solving a recursively defined stopping problem

[TABLE]

for $n=2,\ldots,N$ , where

[TABLE]

and $\tau$ is an $\mathcal{F}_{t}^{\Pi}-$ stopping time.

In Sections 3–5, we treat the problem (10).

Remark 2.1.

In (12), $F_{n-1}(\pi)$ denotes a conditional expectation of $V_{n-1}$ evaluated over a strong Markov process $\Pi_{t}$ starting from the value $\pi$ and diffusing for $\varepsilon$ units ( $\varepsilon$ is analogous to $\delta$ in (1) by letting $\varepsilon=\frac{\delta}{N}$ .). The conditional expectation $\mathbb{E}_{\pi}[V_{n-1}]$ is a function of $\pi$ and it denotes an expectation of the value function $V_{n-1}$ over the diffusion process

[TABLE]

for some starting point $\pi$ , and so it models the learning-from-the-past effect as delayed information after stopping (see, for example, [20] for treatment of an optimal stopping problem with delayed information).

3 Finding a candidate solution

We first have the following result.

Lemma 3.1.

Let $g_{n}$ , $V_{n}$ and $F_{n}$ be as in (10)–(12). Then, for all $n=1,\ldots,N$ , the following hold:

(i)

$g_{n}$ , $V_{n}$ , and $F_{n}$ are convex functions, 2. (ii)

$n(\pi-k)^{+}\leq\max\{0,g_{n}(\pi)\}\leq V_{n}(\pi)\leq F_{n}(\pi)\leq n(1-k)\pi$ .

Proof.

We note that $g_{1}(\pi):=\pi-k$ is convex. By arguments for preservation of convexity for martingale diffusion processes in [15], an expected value $\mathbb{E}_{\pi}[g_{1}(\Pi_{t})]$ is convex in $\pi$ for every fixed time-point $t$ provided that $g$ is a convex function. Moreover, by a Bermudan approximation argument (see [9]), preservation of convexity extends to the corresponding stopping problem, so $V_{1}(\pi)$ is convex. Then, by Jensen’s inequality, $F_{1}\geq V_{1}$ , and clearly $V_{1}\geq\pi-k$ . Convexity of $F_{1}$ follows from convexity of $V_{1}$ .

Next, assume that $g_{n-1},V_{n-1}$ , and $F_{n-1}$ are convex. It follows that $g_{n}=\pi-k+F_{n-1}$ is also convex, and repeating the preservation of convexity and Bermudan approximation arguments yields that $V_{n}$ is convex, and so the convexity of $F_{n}$ follows. It follows that Jensen’s inequality asserts $F_{n}\geq V_{n}$ . That is, by induction, we have that $g_{n}$ , $V_{n}$ , and $F_{n}$ are convex for all $n$ , and $g_{n}\leq V_{n}\leq F_{n}$ .

Moreover, since $(\pi-k)^{+}=\max\{g_{1}(\pi),0\}\leq(1-k)\pi$ , we have

[TABLE]

Assuming that

[TABLE]

one sees that $n(\pi-k)^{+}\leq\max\{0,g_{n}(\pi)\}$ and $g_{n}=\pi-k+F_{n-1}(\pi)\leq n(1-k)\pi$ , and then $V_{n}\leq F_{n}\leq n(1-k)\pi$ . The second statement thus follows by induction. ∎

Remark 3.2.

Note that it follows from the bounds presented in Lemma 3.1 (ii) that $V_{n}(0+)=F_{n}(0+)=0$ and $V_{n}(1-)=F_{n}(1-)=n(1-k)$ . Moreover, Lemma 3.1 implies that the first derivative of $V_{n}$ is bounded.

For an illustration of the relationship between $V_{1}$ and $F_{1}$ presented in Lemma 3.1, see Figure 1.

Remark 3.3.

The function $F_{1}(\pi)$ in Figure 1 is produced numerically. For a short discussion on numerical methods used in this article, see Remark 6.1.

Case $n=1$

Consider

[TABLE]

as given in equation (10). Since this is a value function of a call option type, we expect the optimal strategy to be given by a stopping time

[TABLE]

for some boundary $b_{1}$ . By standard methods in optimal stopping theory and dynamic programming (see, for example, [23]), one expects $V_{1}$ to solve the corresponding free-boundary problem:

[TABLE]

where the differential operator $\mathcal{L}$ is given by

[TABLE]

The general solution to the ODE in (13) is of type

[TABLE]

for constants $A_{1}$ and $B_{1}$ , where $\gamma$ and $\gamma_{-}$ are the positive and the negative solutions to the quadratic equation $\gamma^{2}-\gamma-\frac{2r}{\rho^{2}}=0$ . From the boundary condition at $\pi=0$ we see that $B_{1}\equiv 0$ . We denote

[TABLE]

for which

[TABLE]

Using this notation, the value function assumes the form

[TABLE]

in the continuation region. Plugging in the boundary conditions at $b_{1}$ yields

[TABLE]

By noting that

[TABLE]

one uses the smooth fit equations (17) to derive

[TABLE]

This corresponds to the candidate value function assuming the form

[TABLE]

Verifying that $\hat{V}_{1}\equiv V_{1}$ is straightforward, see Proposition 5.1 in Section 5.

Case $1<n\leq N$

Now, for a general $n=1,\ldots,N$ , consider

[TABLE]

as given in (10). Similarly as above, for each $n$ , we expect the optimal stopping time to be of the form

[TABLE]

for some boundary $b_{n}$ . In particular, we expect the value function $V_{n}$ to solve the free boundary problem

[TABLE]

As in the case $n=1$ above, the smooth-fit guess gives us

[TABLE]

for a constant $A_{n}$ and $G(\pi)$ as in (15). This yields an equation

[TABLE]

If the smooth fit equation (22) admits a unique solution $b_{n}$ , we then define a candidate value function as

[TABLE]

The relationship between $F_{n}$ , $V_{n}$ , $g_{n}$ , and $F_{n-1}$ in the case $n=3$ is illustrated in Figure 2. From the figure it can also be seen how the value functions $V_{n}$ coincide with respective payoff functions $g_{n}$ at the respective boundary points $b_{n}$ for $n=1,2,3$ .

4 Study of the smooth fit equations

We proceed by studying the solvability of equation (22). In order to do so, we first need some technical results.

Lemma 4.1.

Let $f(\pi):=\mathbb{E}_{\pi}[v(\Pi_{\varepsilon})]$ for some $C^{2}((0,1))$ function $v$ that has a bounded first derivative.

[TABLE]

and

[TABLE]

Proof.

For the first claim, let $\tilde{f}(t,\pi)=\mathbb{E}_{\pi}\left[e^{-rt}v(\Pi_{t})\right]$ so that $f(\pi)=e^{r\varepsilon}\tilde{f}(\varepsilon,\pi)$ . Since $\mathcal{L}v\leq 0$ and the first derivative of $v$ is bounded, $e^{-rt}v(\Pi_{t})$ is a supermartingale, and $\tilde{f}(t,\pi)$ is decreasing in $t$ . Therefore, $\left(\mathcal{L}\tilde{f}\right)(t,\pi)=\tilde{f}_{t}(t,\pi)\leq 0$ , and consequently

[TABLE]

For the second claim, by Itô’s formula we get

[TABLE]

Differentiating $\tilde{f}$ with respect to $t$ yields

[TABLE]

where the notation $\Pi^{\pi}_{t}$ is used to indicate that the starting point of $\Pi$ is $\pi$ . Using a non-crossing property of paths (see [22, Chapter IX.3]) and monotonicity of $\mathcal{L}v$ , we find that $\tilde{f}_{t\pi}(t,\pi)\leq 0$ . Consequently

[TABLE]

that is, $\left(\mathcal{L}f\right)(\pi)$ is decreasing in $\pi$ . ∎

Remark 4.2.

In the following, we apply Lemma 4.1 to the function $v=V_{n}$ which is not in $C^{2}$ but merely in $C^{1}((0,1))\cap C^{2}((0,b_{n})\cup(b_{n},1))$ . However, a closer inspection of the proof of the lemma shows that the conclusion also holds in this case.

For the remainder of Section 4, we work under the following assumption.

Assumption 4.3.

For a given $n\geq 2$ , we assume that $\mathcal{L}V_{n-1}(\pi)\leq 0$ and $\mathcal{L}V_{n-1}(\pi)$ is decreasing.

That is, we proceed to show that under Assumption 4.3, the stopping problem with $n$ remaining investments with a one-sided boundary determined by smooth fit is solved. We then use induction to show that the recursive stopping problem is indeed solved for all $n=1,\ldots,N$ .

Lemma 4.1 implies the following result, for which recall that

[TABLE]

Proposition 4.4.

Assume that Assumption 4.3 holds. Then $\left(\mathcal{L}g_{n}\right)(\pi)$ is strictly decreasing. Moreover, there exist unique solutions $\pi_{n}^{0}$ and $\pi_{n}^{*}$ of $g_{n}(\pi^{0}_{n})=0$ and $\left(\mathcal{L}g_{n}\right)(\pi_{n}^{*})=0$ , respectively. Furthermore, $\pi_{n}^{*}\in(0,k]$ and $\pi_{n}^{0}\leq\pi_{n}^{*}\leq k$ .

Proof.

By Lemma 4.1, if $\left(\mathcal{L}V_{n-1}\right)(\pi)$ is decreasing then $(\mathcal{L}F_{n-1})(\pi)$ is decreasing in $\pi$ , and thus

[TABLE]

is strictly decreasing. In addition, if $g_{n}(\pi)<0$ , we have

[TABLE]

Moreover, by (27) we have that, at $\pi=k$ ,

[TABLE]

That is, $(\mathcal{L}g_{n})(\pi)$ is strictly decreasing, and it satisfies $(\mathcal{L}g_{n})(\pi)>0$ for small $\pi$ . Moreover, at $k$ it is non-positive, which shows that a unique solution $\pi_{n}^{*}$ to $\left(\mathcal{L}g_{n}\right)(\pi^{*}_{n})=0$ exists, and also that $\pi_{n}^{*}\in(0,k]$ .

To show the remaining claim, it suffices to note that

[TABLE]

so $\pi_{n}^{0}\leq\pi_{n}^{*}$ . ∎

To show that the equation (22) indeed has a unique solution, we define

[TABLE]

for $\pi\in(0,1)$ . Recall that we expect that the boundary solving the $n$ th free-boundary problem (21) is a solution to the equation $h_{n}(\pi)=0$ .

Proposition 4.5.

Assume that Assumption 4.3 holds. Then, there exists a unique solution $b_{n}$ to the equation $h_{n}(b_{n})=0$ . Moreover, $b_{n}\in(\pi^{*}_{n},b_{1}]$ .

Proof.

Since $G(\pi)g^{\prime}_{n}(\pi)\geq 0$ for all $\pi\in(0,b_{1})$ , we have $h_{n}(\pi)<0$ for $\pi\in(0,\pi^{0}_{n})$ , where $\pi^{0}_{n}$ is the solution to $g_{n}(\pi^{0}_{n})=0$ .

Using $(\mathcal{L}G)(\pi)=0$ , we have

[TABLE]

where the last inequality pair comes directly from Proposition 4.4. The sign of

[TABLE]

coincides with the sign of $h_{n}^{\prime}(\pi)$ . Consequently, $h_{n}(\pi)$ is decreasing on $(0,\pi_{n}^{*})$ and increasing on $(\pi_{n}^{*},1)$ , so there exists at most one solution of $h_{n}(b_{n})=0$ , and for such a solution we must have $b_{n}>\pi^{*}_{n}$ . We next show that $h_{n}(b_{1})\geq 0$ , which then finishes the proof.

To see that $h_{n}(b_{1})\geq 0$ , select a constant $D$ such that $DG(b_{1})-F_{n-1}(b_{1})=0$ . Lemma 4.1, together with properties of $F_{n-1}$ and $G$ (see Lemma 3.1 and equation (16), respectively), yields

[TABLE]

By the maximum principle, it follows that $DG(\pi)-F_{n-1}(\pi)\leq 0$ for $\pi\in(0,b_{1})$ , so

[TABLE]

Consequently, $G^{\prime}(b_{1})F_{n-1}(b_{1})-G(b_{1})F_{n-1}^{\prime}(b_{1})\geq 0$ , so

[TABLE]

where the last equality comes from noting that $G^{\prime}(\pi)(\pi-k)-G(\pi)=0$ is solved by $\pi=b_{1}$ by (19). This completes the proof. ∎

5 Main results

We start by verifying the candidate value function $\hat{V}_{1}(\pi)$ .

Proposition 5.1.

Let $V_{1}(\pi)$ be as in (10) and $\hat{V}_{1}(\pi)$ as in (20). Then $V_{1}(\pi)\equiv\hat{V}_{1}(\pi)$ for all $\pi\in(0,1)$ .

Proof.

When $\pi\geq b_{1}$ , we have $\hat{V}_{1}(\pi)\geq\pi-k$ directly by (20). By the convexity of $\hat{V}_{1}(\pi)$ , it follows that

[TABLE]

also for $\pi<b_{1}$ . Since $b_{1}\geq k$ , when $\pi>b_{1}$ we have

[TABLE]

Similarly, if $\pi<b_{1}$ , by (13), $\left(\mathcal{L}\hat{V}_{1}\right)(\pi)\leq 0$ holds. Therefore, by a standard verification argument (see, for example, [21] for theory and several examples), we indeed have $V_{1}(\pi)\equiv\hat{V}_{1}(\pi)$ . ∎

Next, we provide a verification result for $\hat{V}_{n}(\pi)$ for an $n=2,\ldots,N$ .

Proposition 5.2.

Assume that Assumption 4.3 holds. Let $V_{n}(\pi)$ be as in (10) and $\hat{V}_{n}(\pi)$ as in (23). Then $V_{n}(\pi)\equiv\hat{V}_{n}(\pi)$ for all $\pi\in(0,1)$ .

Proof.

Similarly as in the proof of Proposition 5.1, for the verification argument we need

[TABLE]

and

[TABLE]

For the condition (29), we note that for $\pi<b_{n}$ we automatically have

[TABLE]

On the other hand, when $\pi>b_{n}$ , we have

[TABLE]

since $b_{n}\geq\pi_{n}^{*}$ (see Propositions 4.4 and 4.5).

For the condition (30) we note that if $\pi>b_{n}$ , then by construction we have $\hat{V}_{n}(\pi)=g_{n}(\pi)$ . For $\pi\leq b_{n}$ we argue as follows.

First, we claim that $\hat{V}_{n}>g_{n}$ on $[\pi^{*}_{n},b_{n})$ . In fact, if this was not the case, then there exists $a\in[\pi^{*}_{n},b_{n})$ with $\hat{V}_{n}(a)\leq g_{n}(a)$ . Since $\mathcal{L}g_{n}\leq\mathcal{L}\hat{V}_{n}=0$ on $[a,b_{n}]$ , the maximum principle yields $V_{n}\leq g_{n}$ on $[a,b_{n}]$ . On the other hand, $(\mathcal{L}g_{n})(b_{n})<0=(\mathcal{L}\hat{V}_{n})(b_{n})$ , which implies that $\hat{V}_{n}>g_{n}$ in a left neighborhood of $b_{n}$ , which is a contradiction. It follows that $\hat{V}_{n}>g_{n}$ on $[\pi^{*}_{n},b_{n})$ .

Second, we show that $\hat{V}_{n}\geq g_{n}$ on $(0,\pi^{*}_{n})$ .

Since $\hat{V}_{n}(0)\geq g_{n}(0)$ , $\hat{V}_{n}(\pi^{*}_{n})>g_{n}(\pi^{*}_{n})$ and $\mathcal{L}g_{n}\geq\mathcal{L}\hat{V}_{n}=0$ on $(0,\pi^{*}_{n})$ , the maximum principle gives $\hat{V}_{n}\geq g_{n}$ also on $(0,\pi^{*}_{n})$ .

Since conditions (29) and (30) hold, by standard verification arguments (see note in the proof of Proposition 5.1) we have $\hat{V}_{n}(\pi)\equiv V_{n}(\pi)$ . ∎

To combine the individual verification results 5.1 and 5.2 for our main result, we need the following simple proposition.

Proposition 5.3.

Assume that Assumption 4.3 holds. Then, also $(\mathcal{L}V_{n})(\pi)$ is decreasing.

Proof.

By the construction of the candidate value function in (23) and Proposition 5.2, we have $(\mathcal{L}V_{n})(\pi)=0$ for $\pi<b_{n}$ , and $(\mathcal{L}V_{n})(\pi)=(\mathcal{L}g_{n})(\pi)$ for $\pi>b_{n}$ . Therefore, by Proposition 4.4, $(\mathcal{L}V_{n})(\pi)$ is decreasing. ∎

Using an induction argument, the following theorem contains the main result of our article.

Theorem 5.4.

For $1<n\leq N$ , let $V_{1}(\pi)$ and $V_{n}(\pi)$ be as given in equation (10), and $\hat{V}_{1}(\pi)$ and $\hat{V}_{n}(\pi)$ be as given in equations (20) and (23), respectively. Then

[TABLE]

for all $\pi\in(0,1)$ and for all $n=2,\ldots,N$ . Moreover, for each $n$ , the optimal investment strategy is to invest at random times

[TABLE]

characterized by a sequence of stopping boundaries $\{b_{n}\}_{n=1,\ldots,N}$ , where $b_{1}$ is given by the equation (19), and for each $n=2,\ldots,N$ , the boundary $b_{n}$ is the unique solution of $h_{n}(b_{n})=0$ , where $h_{n}$ is given in equation (28).

Proof.

Claim $\hat{V}_{1}(\pi)\equiv V_{1}(\pi)$ comes directly from Proposition 5.1. It follows that

[TABLE]

is decreasing and so by Proposition 5.3, $(\mathcal{L}V_{2})(\pi)$ is also decreasing. Then, for any given $n>2$ , assume that $(\mathcal{L}V_{n-1})(\pi)$ is decreasing. This assumption together with Proposition 5.3 implies that also $(\mathcal{L}V_{n})(\pi)$ is decreasing. That is, by induction $(\mathcal{L}V_{n})(\pi)$ is decreasing for all $n$ . In addition, Proposition 5.2 then asserts that $\hat{V}_{n}(\pi)\equiv V_{n}(\pi)$ for all $n$ with $n=2,\ldots,N$ . The optimality of $\tau_{n}$ for all $n=1,\ldots,N$ follows by construction.

∎

An example of the optimal strategy characterized as boundaries $\{b_{n}\}_{n=1}^{N}$ is illustrated in the following figure.

We finish the study of the learning-from-the-past effect by discussing some properties of boundaries $\{b_{n}\}_{n=1}^{N}$ with numerical methods.

6 Comparative statistics

We conduct a numerical study on the behavior of boundary $\{b_{n}\}_{n=1}^{N}$ with respect to changes in the model parameters. To highlight some of the results, we show that a case with $b_{n}<k$ is possible (see Figure 4), which shows that it may be optimal to invest in a project with negative expected value (similar observations have been made in [10]). Similarly, we show that $b_{n}$ is not monotone with respect to $\mu_{1}$ (see Figure 7), demonstrating a mixed effect between $k$ and signal-to-noise ratio $\rho$ . We then conclude with an observation that increasing $N$ (and so decreasing learning per individual investment) decreases $b_{n}$ (see Figure 8).

Remark 6.1.

Recall that boundary $b_{n}$ is given by solving (22). To solve for the boundaries, functions $F_{n}$ are solved using a finite differences method. We consider a second-order partial differential equation

[TABLE]

with $\tilde{F}_{n}(t,\pi):=E_{\pi}\left[V_{n}(\Pi_{t})\right]$ . In particular, $\tilde{F}_{n}(\varepsilon,\pi)\equiv F_{n}(\pi)$ . Moreover, we establish boundary values of $\tilde{F}_{n}(t,\pi)$ at $\pi=0$ and $\pi=1$ using the results in Lemma 3.1. The boundaries $b_{n}(\pi)$ are solved on a discrete $\pi$ grid but the values plotted and used in recursion are taken as weighted averages between the grid points. In all figures, the numerically solved boundaries $b_{n}$ are produced using the same set of parameters unless otherwise mentioned. Boundaries $b_{1}$ are solved numerically and they are verified with the explicit values given in (19).

In Figure 4, we plot the boundaries for two different levels of $N\varepsilon$ . Note how the boundary $b_{1}$ is not dependent on the chosen level of maximum learning, and for some $n$ , $b_{n}<k$ for a higher amount of total learning. This suggests a tradeoff between learning and earning, a prevalent topic in many studies of incomplete information (see, for example [13] for a classical study under a Bayesian setting).

Next, we compare the boundaries for different levels of $\sigma$ (the diffusion coefficient of the observation process (4)), which affects dynamics of the process $\Pi_{t}$ (described in (7)) via the signal-to-noise ratio $\rho=\frac{\mu_{1}-\mu_{0}}{\sigma}$ . It is expected that the boundary $b_{n}$ is increasing with respect to $\rho$ and so decreasing with respect to $\sigma$ : A higher signal-to-noise ratio implies that the investor learns more only by observing $X_{t}$ , whereas a lower signal-to-noise ratio makes the investor more eager to invest to attain additional learning. This phenomenon is confirmed in Figure 5.

Similar comparison can be done for the discount rate $r$ . Intuitively, high discount rate penalizes waiting which is tied to earlier investment times. This is confirmed in Figure 6, where we compare boundaries for different levels of $r$ .

It is also noteworthy to comparie boundaries for different project values $\mu_{0}$ and $\mu_{1}$ . Recall that in Section 2 we motivated the solution concept to our problem by defining

[TABLE]

that is, comparing different project values $\mu_{0}$ and $\mu_{1}$ reduces to comparing different values for $k$ and $\rho$ . We expect that increasing $k$ pushes up the boundary, and with Figure 5 we argued that the boundary $b_{n}$ is also monotone with respect to $\rho$ . Moreover, both $k$ and $\rho$ are monotone with respect to $\mu_{0}$ . However, $k$ and $\rho$ have different monotonities in $\mu_{1}$ , leading to a mixed effect. This is demonstrated in Figure 7, where there is no monotonicity of $b_{n}$ with respect to $\mu_{1}$ .

Increasing the number of possible investments $N$ should also affect the boundary. Such comparison is possible with fixing $N\varepsilon$ , i.e. to scale inversely the amount of additional learning $\varepsilon$ with $N$ . One expects increasing $N$ to decrease the boundary $b_{n}$ , reducing the effect of an individual investment. This is confirmed in Figure 8.

We finish with the following remark on possible further extensions to our research.

Remark 6.2.

We note that solving for the multiple stopping problem (10) indeed is a discrete analogue of solving the continuous problem (1)–(2). By taking the limit $N\to\infty$ and by detaching from the discrete grid in investment levels and optimal stopping times, we expect the problem to collapse into a continuous stochastic control problem, where one attains a continuous boundary $b$ corresponding to optimal control of the process $(U_{t})_{t\geq 0}$ , formal definition of which we leave for further research. Indeed, definition of the admissible class of controls and characterizing the boundary $b$ through suitable boundary conditions remains to be an ample opportunity for future research.

Acknowledgement.

We sincerely thank Erik Ekström for his patient, kind, and useful guidance and countless discussions which were invaluable in shaping this article to its current form.

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Carmona and S. Dayanik. Optimal multiple stopping of linear diffusions. Math. Oper. Res. , 33(2):446–460, 2008.
2[2] R. Carmona and N. Touzi. Optimal multiple stopping and valuation of swing options. Mathematical Finance , 18(2):239–268, 2008.
3[3] R. C. Dalang and A. N. Shiryaev. A quickest detection problem with an observation cost. Ann. Appl. Probab. , 25(3):1475–1512, 2015.
4[4] J.-P. Décamps, T. Mariotti, and S. Villeneuve. Irreversible investment in alternative projects. Econom. Theory , 28(2):425–448, 2006.
5[5] A. K. Dixit and R. S. Pindyck. Investment under Uncertainty . Princeton University Press, 1994.
6[6] J.-P. Décamps, T. Mariotti, and S. Villeneuve. Investment timing under incomplete information. Mathematics of Operations Research , 30(2):472–500, 2005.
7[7] E. Ekström and I. Karatzas. A sequential estimation problem with control and discretionary stopping. Probab. Uncertain. Quant. Risk , 7(3):151–168, 2022.
8[8] E. Ekström and A. Milazzo. A detection problem with a monotone observation rate. Stochastic Process. Appl. , 172:Paper No. 104337, 19, 2024.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Learning from the past in an irreversible investment problem

Abstract

1 Introduction

1.1 Related literature

1.2 Structure of the article

2 Problem set-up

Remark 2.1**.**

3 Finding a candidate solution

Lemma 3.1**.**

Proof.

Remark 3.2**.**

Remark 3.3**.**

Case n=1n=1n=1

Case 1<n≤N1<n\leq N1<n≤N

4 Study of the smooth fit equations

Lemma 4.1**.**

Proof.

Remark 4.2**.**

Assumption 4.3**.**

Proposition 4.4**.**

Proof.

Proposition 4.5**.**

Proof.

5 Main results

Proposition 5.1**.**

Proof.

Proposition 5.2**.**

Proof.

Proposition 5.3**.**

Proof.

Theorem 5.4**.**

Proof.

6 Comparative statistics

Remark 6.1**.**

Remark 6.2**.**

Acknowledgement**.**

Remark 2.1.

Lemma 3.1.

Remark 3.2.

Remark 3.3.

Case $n=1$

Case $1<n\leq N$

Lemma 4.1.

Remark 4.2.

Assumption 4.3.

Proposition 4.4.

Proposition 4.5.

Proposition 5.1.

Proposition 5.2.

Proposition 5.3.

Theorem 5.4.

Remark 6.1.

Remark 6.2.

Acknowledgement.