Strict monotonicity of principal eigenvalues of elliptic operators in $\mathbb{R}^d$ and risk-sensitive control
Ari Arapostathis, Anup Biswas, Subhamay Saha

TL;DR
This paper investigates the principal eigenvalues of elliptic operators in ^d, linking their strict monotonicity to ergodic properties and applying these insights to risk-sensitive control problems for diffusions.
Contribution
It characterizes the strict monotonicity of principal eigenvalues in relation to ergodic properties and extends results to equations with measurable coefficients, also establishing duality in ergodic control.
Findings
Strict monotonicity characterizes ergodic properties and uniqueness of ground states.
Established strong duality for ergodic control linear programming formulations.
Proved existence and optimality of Markov controls in risk-sensitive control problems.
Abstract
This paper studies the eigenvalue problem on for a class of second order, elliptic operators of the form , associated with non-degenerate diffusions. We show that strict monotonicity of the principal eigenvalue of the operator with respect to the potential function fully characterizes the ergodic properties of the associated ground state diffusion, and the unicity of the ground state, and we present a comprehensive study of the eigenvalue problem from this point of view. This allows us to extend or strengthen various results in the literature for a class of viscous Hamilton-Jacobi equations of ergodic type with smooth coefficients to equations with measurable drift and potential. In addition, we establish the strong duality for the equivalent infinite dimensional linear programming formulation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Strict monotonicity of principal eigenvalues of elliptic operators
in
and risk-sensitive control
Ari Arapostathis
Anup Biswas
Subhamay Saha
Department of ECE, The University of Texas at Austin, 2501 Speedway, EER 7.824, Austin, TX 78712, USA
Department of Mathematics, Indian Institute of Science Education and Research,
Dr. Homi Bhabha Road, Pune 411008, India
Department of Mathematics, Indian Institute of Technology Guwahati, Assam 781039, India
Abstract
This paper studies the eigenvalue problem on for a class of second order, elliptic operators of the form , associated with non-degenerate diffusions. We show that strict monotonicity of the principal eigenvalue of the operator with respect to the potential function fully characterizes the ergodic properties of the associated ground state diffusion, and the unicity of the ground state, and we present a comprehensive study of the eigenvalue problem from this point of view. This allows us to extend or strengthen various results in the literature for a class of viscous Hamilton–Jacobi equations of ergodic type with smooth coefficients to equations with measurable drift and potential. In addition, we establish the strong duality for the equivalent infinite dimensional linear programming formulation of these ergodic control problems. We also apply these results to the study of the infinite horizon risk-sensitive control problem for diffusions, and establish existence of optimal Markov controls, verification of optimality results, and the continuity of the controlled principal eigenvalue with respect to stationary Markov controls.
keywords:
generalized principal eigenvalue, recurrence and transience, viscous Hamilton–Jacobi equation, risk-sensitive control, ergodic control, nonlinear eigenvalue problems.
MSC:
[2010] Primary: 35P15, Secondary: 35B40, 35Q93, 60J60, 93E20
††journal: Journal de Mathématiques Pures et Appliquées
=
Contents
1 Introduction
In this paper we study the eigenvalue problem on for non-degenerate, second order elliptic operators of the form
[TABLE]
Here , and , satisfy a linear growth assumption in the outward radial direction (see (A2) in Subsection 1.1). In other words, and satisfy the usual assumptions for existence and uniqueness of a strong solution of the Itô equation
[TABLE]
where is a standard Brownian motion.
We focus on certain properties of the principal eigenvalue of the operator which play a key role in infinite horizon risk-sensitive control problems. When is a smooth bounded domain, and , , are regular enough, existence of a principal eigenvalue and corresponding eigenfunction under a Dirichlet boundary condition can be obtained by an application of Krein-Rutman theory (see for instance [1, 2]). This eigenvalue is the bottom of the spectrum of with Dirichlet boundary condition. For non-smooth domains, a generalized notion of a principal eigenvalue was introduced in the seminal work of Berestycki, Nirenberg and Varadhan [3]. An analogous theory for non-linear elliptic operators has been developed by Quaas and Sirakov in [4]. The principal eigenvalue plays a key role in the study of non-homogeneous elliptic operators and the maximum principle (see [3, 5, 6, 4]). For some other definitions of the principal (or critical) eigenvalue we refer the reader to the works of Pinchover [7] and Pinsky [2, Chapter 4].
For unbounded domains, principal eigenvalue problems have been recently considered by Berestycki and Rossi in [8, 5]. Not surprisingly, certain properties of the principal eigenvalue which hold in bounded domains may not be true for unbounded ones. For instance, when is smooth and bounded it is well known that for the Dirichlet boundary value problem, the principal eigenvalue is simple, and the associated principal eigenfunction is positive. Moreover, it is the unique eigenvalue with a positive eigenfunction. But if is unbounded and smooth, then there exists a constant such that any is an eigenvalue of with a positive eigenfunction [5, Theorem 1.4] (see also [6] and [9, Theorem 2.6]). The lowest such value serves as a definition of the principal eigenvalue when is not bounded. The principal eigenvalue is known to be strictly monotone as a function of the bounded domain (the latter ordered with respect to set inclusion), and also strictly monotone in the coefficient when the domain is bounded (see [5] and Lemma 2.1 below). These properties fail to hold in unbounded domains as remarked by Berestycki and Rossi [5, Remark 2.4]. Strict monotonicity of and its implications are a central theme in our study. We adopt a probabilistic approach in our investigation. One can view as a risk-sensitive average of over the diffusion in Eq. 1.2. More precisely, since Eq. 1.2 has a unique solution which exists for all , then we can define
[TABLE]
with ‘’ denoting the natural logarithm. As shown in the proof of Lemma 2.3 in [10] we have , and equality is indeed the case in many important situations, although strictly speaking it is only a heuristic. This heuristic is based on the fact that for a bounded , the operator is the infinitesimal generator of a strongly continuous, positive semigroup with potential , see for instance [11, Chapter IV]. If is a bounded continuous function, and if the occupation measures of obey a large deviation principle, then one can express in terms of the large deviation rate function. This is known as the variational representation for the eigenvalue. See for instance the article by Donsker and Varadhan [12] where this representation is obtained for compact domains. But large deviation principles for are generally available only under strong hypotheses on the process (see [13]). In this paper we rely on the stochastic representation of the principal eigenfunction which can be established under very mild assumptions. This approach has been recently used by Arapostathis and Biswas in [10] to study the multiplicative Poisson equation when is near-monotone (which includes the case of inf-compact ). By an eigenpair of we mean a pair , with a positive function in , for all , and , that satisfies
[TABLE]
We refer to as the eigenvalue, and to as the eigenfunction. In Eq. 1.4, and elsewhere in this paper, we adopt the notation and for , and use the standard summation rule that repeated subscripts and superscripts are summed from through .
As mentioned earlier, such a pair exists only if (see Corollary 2.1). Given an eigenpair , the associated twisted diffusion (a terminology used in [14]) is an Itô process as in Eq. 1.2, but with the drift replaced by . It is not generally the case that the twisted process has a strong solution which exists for all time. If the twisted diffusion is always transient (see Lemma 2.6). When , the eigenfunction is denoted as and is called the ground state [2, 15]. The corresponding twisted diffusion, denoted by , is referred to as the ground-state diffusion.
Let () denote the class of non-zero, nonnegative real valued continuous functions on which vanish at infinity (have compact support). We say that is strictly monotone at if there exists satisfying . We also say that is strictly monotone at on the right if for all . In Theorem 2.1 we show that strict monotonicity at implies strict monotonicity at on the right. Our main results provide sharp characterizations of the ground state and the ground state process in terms of these monotonicity properties. Assume that is a locally bounded, Borel measurable function, satisfying , and that is finite. We show that strict monotonicity of at on the right implies the simplicity of , i.e., the uniqueness of the ground state , and that this is also a necessary and sufficient condition for the ground state process to be recurrent (see Lemmas 2.7 and 2.3). Another important result is that the ground state diffusion is exponentially ergodic (see Definition 2.2) if and only if is strictly monotone at . These results are summarized in Theorem 2.1 in Section 2. Other results in Section 2 provide a characterization of the eigenvalue in terms of the long time behavior of the twisted process and stochastic representations of the ground state (see Lemmas 2.2, 2.3 and 2.7, and Theorem 2.6).
In [2], Pinsky uses the existence of a Green’s measure to define the critical eigenvalue of a non-degenerate elliptic operator. This critical eigenvalue coincides with the principal eigenvalue when the boundary of the domain and the coefficients of are smooth enough. He shows that for any bounded domain, and provided that the coefficients are in , , and bounded, there exists a critical value such that for any we can find a Green’s measure corresponding to the operator [2, Theorem 4.7.1]. The result in Theorem 2.3 in Section 2 extends this to without assuming much regularity on the coefficients.
Continuous dependence of on the coefficients of has also been a topic of interest. It is not hard to see that is lower-semicontinuous in the topology for . Continuity of this map is also established in [5, Proposition 9.2] with respect to the norm of . In Theorems 2.6 and 4.1 we study the continuity of for a class of functions under the topology. We also obtain a pinned multiplicative ergodic theorem which is of independent interest, and show that for a large class of problems.
We next discuss the connection of this problem with a stochastic ergodic control problem. Defining we obtain from Eq. 1.4 that
[TABLE]
It is easy to see that Eq. 1.5 is related to an ergodic control problem with controlled drift and running cost . The parameter can be thought of as the optimal ergodic value; see Ichihara [16]. Note then that the twisted process defined above corresponds to the optimally controlled diffusion. We refer to Ichihara [17, 16] and Kaise and Sheu [9] for some important results in this direction. For a potential that vanishes at infinity, Ichihara [16, 18] considers the ergodic control problem in Eq. 1.5, with a more general Hamiltonian and under scaling of the potential. When is nonnegative, it is shown that the value of the ergodic problem with potential , , equals the eigenvalue , and is the optimal control when the parameter exceeds a critical value , while below that critical value a bifurcation occurs. Analogous are the results in [19] for viscous Hamilton–Jacobi equations with the identity matrix and a Hamiltonian which is a power of the gradient term. Most of the above results are obtained for bounded, and Lipschitz continuous , , and . In Theorems 2.7 and 2.8 we extend these results to measurable and , and possibly unbounded and .
Optimality for the ergodic problem is shown in [16, 18] via the study of the optimal finite horizon problem (Cauchy parabolic problem). Inevitably, in doing so, optimality is shown in a certain class of controls. To overcome this limitation, we take a different approach to the ergodic control problem in Eq. 1.5. As well known, ergodic control problems can be cast as infinite dimensional linear programs [20, 21]. Consider a controlled diffusion, with the control taking values in a space with extended generator , where the ‘action’ enters implicitly as a parameter in . Let denote the running cost. The primal problem then can be written
[TABLE]
Here denotes the class of probability measures on the Borel -field of . Its elements are called ergodic occupation measures (see [20]). The dual problem takes the form
[TABLE]
where denotes the domain of . In other words the dual problem is a maximization over subsolutions of the Hamilton–Jacobi–Bellman (HJB) equation. For non-degenerate diffusions with a compact action space , under the hypothesis that is near-monotone, or under uniform ergodicity conditions, it is well known that we have strong duality, i.e., . To the best of our knowledge, this has not been established for problems with non-compact action spaces. In Theorem 2.9 we establish strong duality for the ergodic problem in Eq. 1.5. In this result, the coefficients and are bounded and measurable, and is bounded, Lipschitz, and uniformly elliptic. Moreover, we establish the unicity of the optimal ergodic occupation measure, and as a result of this, the uniqueness of the optimal stationary Markov control. The methodology is general enough that can be applied to various classes of ergodic control problems that are characterized by viscous HJB equations.
The results in [17, 9] are obtained for smooth coefficients (), and under an assumption of exponential ergodicity (see Eq. 3.12 below). We provide a sufficient condition in (H2) under which strict monotonicity of the principal eigenvalue holds. It is also shown that the exponential ergodicity condition of [17, 9] actually implies (H2); thus (H2) is weaker. Moreover, Eq. 3.12 cannot hold for bounded coefficients and . See Remark 3.4 for details. In Theorem 3.3 we cite a sufficient condition under which strict monotonicity of holds even when and are bounded. Let us also remark that the method of proof [17, 9] utilizes the smoothness of the coefficients , and . This is because a gradient estimate (Bernstein method) is required, which is not available under weaker regularity. But this amount of regularity might not be available in many situations, for instance in models with a measurable drift which are often encountered in stochastic control problems. Let us also mention the unpublished work of Kaise and Sheu in [22] that contains some results similar to ours, in particular, similar to the results in Section 3 and the pinned multiplicative ergodic theorems. These results are also obtained under sufficient smoothness of the coefficients , , and .
In Section 4 we apply the above mentioned results to study the infinite horizon risk-sensitive control problem. We refer the reader to [10] where the importance of these control problems is discussed. Unfortunately, the development of the infinite horizon risk-sensitive control problem for controlled diffusions has not been completely satisfactory, and the same applies to controlled Markov chain models. Most of the available results have been obtained under restrictive settings, and a full characterization such as uniqueness of the solution to the risk-sensitive HJB equation, and verification of optimality results is lacking. Let us give a quick overview of the existing literature on risk-sensitive control in the context of controlled diffusions which is relevant to our problem. Risk-sensitive control for models with a constant diffusion matrix and asymptotically flat drift is studied by Fleming and McEneaney in [23]. Another particular setting is considered by Nagai [24], where the action space is the whole Euclidean space, and the running cost has a specific structure. Menaldi and Robin have considered models with periodic data [25]. Under the assumption of a near-monotone cost, the infinite horizon risk-sensitive control problem is studied in [10, 26, 27], whereas Biswas in [28] has considered this problem under the assumption of exponential ergodicity. Differential games with risk-sensitive type costs have been studied by Basu and Ghosh [29], Biswas and Saha [30], and Ghosh et. al. [31]. All the above studies, have obtained existence of a pair that satisfies the risk-sensitive HJB equation, with the optimal risk-sensitive value, and show that any minimizing selector of the HJB is an optimal control. The works in [24, 25] address the existence and uniqueness of a solution to the HJB equation, in their particular set up, but do not contain any verification of optimality results. Two main results that are missing from the existing literature, with the exception of [10], are (a) uniqueness of the solution to the HJB equation, and (b) verification for optimal control.
Following the ergodic control paradigm, we can identify two classes of models: (i) models with a near-monotone running cost and finite optimal value, and no other hypotheses on the dynamics, and (ii) models that enjoy a uniform exponential ergodicity. Near-monotone running cost models are studied in [24, 10, 26, 27]; however, only [10] obtains a full characterization without imposing a blanket ergodicity hypothesis. Studies for models in class (ii) can be found in [23, 29, 28, 31].
In this paper we study models in class (ii). The results developed in Sections 2 and 3 enable us to obtain a full characterization of the risk-sensitive control problem in Section 4. The main hypotheses are Assumptions 4.1 and 4.2. Another interesting result that we establish in Section 4 is the continuity of the controlled principal eigenvalue with respect to (relaxed) stationary Markov controls (see Theorem 4.3). This facilitates establishing the existence of an optimal stationary Markov control for risk-sensitive control problems under risk-sensitive type constraints. Let us also remark that this existence result is far from being obvious, since the controlled risk-sensitive value is lower-semicontinuous with respect to Markov controls and the equality is not true in general. Moreover, the usual technique of Lagrange multipliers does not work in this situation, because of the non-convex nature of the optimization criterion.
To summarize the main contributions of the paper, we have established several characterizations of the property of strict monotonicity of the principal eigenvalue, and extended several results in the literature on viscous HJB equations with potentials vanishing at infinity, and smooth data, to measurable potential and drift (Theorems 2.1, 2.2, 2.6, 2.7, 2.8, 2.9, 2.10 and 3.2). We have also studied a general class of risk-sensitive control problems under a uniform ergodicity hypothesis, and established the uniqueness of a solution to the HJB equation and verification of optimality results (Theorems 4.1 and 4.2). Equally interesting are the continuity results of the controlled principal eigenvalue with respect to stationary Markov controls (Theorems 4.3 and 4.5).
The paper is organized as follows. Subsection 1.1 states the assumptions on the coefficients of the operator , and Subsection 1.2 summarizes the notation used in the paper. The first three subsections of Section 2 contain the main results on the principal eigenvalue under minimal assumptions, while Subsection 2.4 is devoted to operators with potential which vanishes at infinity. Section 3 improves on the results of Section 2, under the assumption that Eq. 1.2 is exponentially ergodic. Section 4 is dedicated to the infinite horizon, risk-sensitive optimal control problem.
1.1 Assumptions on the model
The following assumptions on the coefficients of are in effect throughout the paper unless otherwise mentioned.
- (A1)
Local Lipschitz continuity: The function \upsigma\;=\;\bigl{[}\upsigma^{ij}\bigr{]}\,\colon\,\mathbb{R}^{d}\to\mathbb{R}^{d\times d} is locally Lipschitz in with a Lipschitz constant depending on . In other words, with , we have
[TABLE]
We also assume that b\;=\;\bigl{[}b^{1},\dotsc,b^{d}\bigr{]}^{\mathsf{T}}\,\colon\,\mathbb{R}^{d}\to\mathbb{R}^{d} is locally bounded and measurable.
- (A2)
Affine growth condition: and satisfy a global growth condition of the form
[TABLE]
for some constant .
- (A3)
Nondegeneracy: For each , it holds that
[TABLE]
and for all , where, as defined earlier, .
Let us remark that the assumptions (A1)–(A3) are not optimal, and can be weakened in many situations. For instance, if is continuous and its weak derivative lies in , then Eq. 2.1 has a unique strong solution (see [32]). The results in this paper can be extended to this setup as well.
1.2 Notation
The standard Euclidean norm in is denoted by , and denotes the inner product. The set of nonnegative real numbers is denoted by , stands for the set of natural numbers, and denotes the indicator function. Given two real numbers and , the minimum (maximum) is denoted by (), respectively. The closure, boundary, and the complement of a set are denoted by , , and , respectively. We denote by the first exit time of the process from the set , defined by
[TABLE]
The open ball of radius in , centered at the origin, is denoted by , and we let , and .
The term domain in refers to a nonempty, connected open subset of the Euclidean space . For a domain , the space (), , refers to the class of all real-valued functions on whose partial derivatives up to order (of any order) exist and are continuous. Also, () is the class of functions whose partial derivatives up to order (of any order) are continuous and bounded in , and denotes the subset of , , consisting of functions that have compact support. In addition, denotes the class of continuous functions on that vanish at infinity. By and we denote the subsets of and , respectively, consisting of all non-trivial nonnegative functions. We use the term non-trivial to refer to a function that is not a.e. equal to [math]. The space , , stands for the Banach space of (equivalence classes of) measurable functions satisfying , and is the Banach space of functions that are essentially bounded in . The standard Sobolev space of functions on whose generalized derivatives up to order are in , equipped with its natural norm, is denoted by , , . For a probability measure in and a real-valued function which is integrable with respect to we use the notation
[TABLE]
In general, if is a space of real-valued functions on , consists of all functions such that for every . In this manner we obtain for example the space .
We often use Krylov’s extension of the Itô formula for functions in [33, p. 122], which we refer to as the Itô–Krylov formula.
2 General results
Let be a given filtered probability space with a complete, right continuous filtration . Let be a standard Brownian motion adapted to . Consider the stochastic differential equation
[TABLE]
The third term on the right hand side of Eq. 2.1 is an Itô stochastic integral. We say that a process is a solution of Eq. 2.1, if it is -adapted, continuous in , defined for all and , and satisfies Eq. 2.1 for all a.s. It is well known that under (A1)–(A3), there exists a unique solution of Eq. 2.1 [34, Theorem 2.2.4]. We let denote the expectation operator on the canonical space of the process with , and the corresponding probability measure. Recall that denotes the first exit time of the process from a domain . The process is said be recurrent if for any bounded domain we have for all . Otherwise the process is called transient. A recurrent process is said to be positive recurrent if for all . It is known that for a non-degenerate diffusion the property of recurrence (or positive recurrence) is independent of and , i.e., if it holds for some domain and , then it also holds for every domain , and all points (see [34, Lemma 2.6.12 and Theorem 2.6.10]). We define the extended operator associated to Eq. 2.1 by
[TABLE]
Let be a locally bounded, Borel measurable function, which is bounded from below in , i.e., . We refer to a function with these properties as a potential, and let .
2.1 Risk-sensitive value and Dirichlet eigenvalues
The following lemma summarizes some results from [3, 5, 4] on the eigenvalues of the Dirichlet problem for the operator . For simplicity, we state it for balls , instead of more general domains.
Lemma 2.1
For each there exists a unique pair (\widehat{\Psi}_{r},\hat{\lambda}_{r})\in\bigl{(}\mathscr{W}_{\mathrm{loc}}^{2,p}(B_{r})\cap C(\bar{B}_{r})\bigr{)}\times\mathbb{R}, for any , satisfying on , on , and , which solves
[TABLE]
with as defined in Eq. 2.2. Moreover, has the following properties:
The map is continuous and strictly increasing. 2. 2.
In its dependence on the function , is nondecreasing, convex, and Lipschitz continuous (with respect to the norm), with Lipschitz constant . In addition, if , then .
Proof 1
Existence and uniqueness of the solution follow by [4, Theorem 1.1] (see also [3]). Part (a) follows by [5, Theorem 1.10], and (iii)–(iv) of [5, Proposition 2.3], while part (b) follows by [3, Proposition 2.1]. \qed
We refer to as the eigensolution of the Dirichlet problem, or the Dirichlet eigensolution of on . Correspondingly, and are referred to as the Dirichlet eigenvalue and Dirichlet eigenfunction, respectively.
Lemma 2.1 (a) motivates the following definition.
Definition 2.1
Let be a potential. The principal eigenvalue on of the operator given in Eq. 1.1 is defined as .
For a potential we also define
[TABLE]
We refer to as the risk-sensitive average of . This quantity plays a key role in our analysis.
We also compare Definition 2.1 with the following definition of the principal eigenvalue, commonly used in the pde literature [5].
[TABLE]
The following hypothesis is enforced throughout Section 2 without further mention, and it is repeated only for emphasis.
- (H1)
is a potential, and is finite.
Lemma 2.2
The following hold
For any , the Dirichlet eigensolutions in Eq. 2.3 have the following stochastic representation
[TABLE]
for all large enough . 2. 2.
It holds that . 3. 3.
Let be any limit point of the Dirichlet eigensolutions as , and be an open ball centered at [math] such that for some bounded function . Then with denoting the first hitting time of we have
[TABLE]
Proof 2
Part (i) follows from [10, Lemma 2.10 (i)].
Turning to part (ii), suppose that is finite. Then it is standard to show that there exists a positive which satisfies
[TABLE]
See [10, 26] for instance. It is then clear that .
To show the converse inequality, suppose that a pair , with , satisfies
[TABLE]
We claim that . If not, then we can find a pair as in by Lemma 2.1, satisfying Eq. 2.3 and . By the Itô–Krylov formula [33, p. 122] we have
[TABLE]
Since is positive, Eqs. 2.6 and 2.10 imply that we can scale it by multiplying with a constant so that attains it minimum in and this minimum value is [math]. Combining Eqs. 2.3 and 2.9, we obtain
[TABLE]
It then follows by the strong maximum principle [35, Theorem 9.6] that in , which is not possible since on . This proves the claim. Since was arbitrary, this implies that , and thus we have equality.
It remains to prove Eq. 2.7. We follow the same argument as in [10, Lemma 2.10]. We fix . Letting in Eq. 2.6 and applying Fatou’s lemma we obtain
[TABLE]
Thus, with denoting a solution of Eq. 2.8, with replaced by and , we also have
[TABLE]
which implies that
[TABLE]
since in . We write Eq. 2.6 as
[TABLE]
Note that since , the first term on the right hand side of Eq. 2.13 is finite by Eq. 2.12 for all large enough . Let
[TABLE]
The second term on the right hand side of Eq. 2.13 has the bound
[TABLE]
By the convergence of as , uniformly on compact sets, and since is bounded away from [math] in , uniformly in , by Harnack’s inequality, we have as . Therefore, the second term on the right hand side of Eq. 2.13 vanishes as . Also, since is nondecreasing in , and , we obtain
[TABLE]
by Eq. 2.12 and dominated convergence. Thus taking limits in Eq. 2.13 as , and using Eqs. 2.14 and 2.11, we obtain Eq. 2.7. This completes the proof. \qed
Combining Lemma 2.2 (ii) and [5, Theorem 1.4] we have the following result.
Corollary 2.1
There exists a positive , , satisfying
[TABLE]
if and only if .
As also mentioned in the introduction, throughout the rest of the paper, by an eigenpair of we mean a positive function and a scalar that satisfy Eq. 2.15. In addition, the eigenfunction is assumed to be normalized as , unless indicated otherwise. When is the principal eigenvalue, we refer to as a principal eigenpair. Note, that in view of the assumptions on the coefficients, any which satisfies Eq. 2.15 belongs to , for all . Therefore, in the interest of notational economy, we refrain from mentioning the function space of solutions of equations of the form Eq. 2.15, and any such solution is meant to be in . Moreover, since these are always strong solutions, we often suppress the qualifier ‘a.e.’, and unless a different domain is specified, such equations or inequalities are meant to hold on .
2.2 Summary of results
A major objective in this paper is to relate the properties of the eigenvalues in Eq. 2.15 to the recurrence properties of the twisted process which is defined as follows. For an eigenfunction satisfying Eq. 2.15 we let . Then we can write Eq. 2.15 as
[TABLE]
The twisted process corresponding to an eigenpair of is defined by the SDE
[TABLE]
Since , , it follows that is locally bounded (in fact it is locally Hölder continuous), and therefore Eq. 2.17 has a unique strong solution up to its explosion time. We let denote the extended generator of Eq. 2.17, and the associated expectation operator. The reader might have observed that the twisted process corresponds to Doob’s -transformation of the operator with .
With denoting a principal eigenfunction, i.e., an eigenfunction associated with , we let , and denote by the corresponding twisted process. A twisted process corresponding to a principal eigenpair is called a ground state process, and the eigenfunction is called a ground state.
Recall that denotes the collection of all non-trivial, nonnegative, continuous functions which vanish at infinity. We consider the following two properties of .
- (P1)
Strict monotonicity at . For some we have .
- (P2)
Strict monotonicity at on the right. For all we have .
It follows by the convexity of that (P1) implies (P2).
Later, in Section 3, we provide sufficient conditions under which (P1) holds. Also, the finiteness of and is implicit in (P1). Indeed, since for every positive , and we have
[TABLE]
it follows that and are either both finite, or both equal to . It is also clear that always hold. As shown in Theorem 2.2, (P1) implies that for all .
We introduce the following definition of exponential ergodicity which we often use.
Definition 2.2** (exponential ergodicity)**
The process governed by Eq. 1.2 is said to be exponentially ergodic if for some compact set and we have \operatorname{\mathbb{E}}_{x}\bigl{[}\mathrm{e}^{\delta\,\uptau(\mathscr{B}^{c})}\bigr{]}<\infty, for all .
The main results of this section center around the following theorem.
Theorem 2.1
Under (H1), the following hold:
A ground state process is recurrent if and only if is strictly monotone at on the right, in which case the principal eigenvalue is also simple, and the ground state satisfies
[TABLE] 2. 2.
The ground state process is exponentially ergodic if and only if is strictly monotone at . 3. 3.
If , the twisted process Eq. 2.17 corresponding to any solution of Eq. 2.16 is transient.
Proof 3
Part (a) follows by Lemmas 2.7, 2.3 and 2.3. Part (b) is the statement of Theorem 2.2, while part (c) is shown in Lemma 2.6. \qed
Theorem 2.1 should be compared with the results in [17, Theorem 2.2] and [9, Theorem 3.2 and 3.7]. The results in [17, 9] are obtained under a stronger hypothesis (same as Eq. 3.12 below) and for sufficiently regular coefficients. For a similar result in a bounded domain we refer the reader to [2, Theorem 4.2.4], where results are obtained for a certain class of operators with regular coefficients.
We remark that (P1) does not imply that the underlying process in Eq. 2.1 is recurrent. Indeed consider a one-dimensional diffusion with and , and let . Then Eq. 2.15 holds with and . But , so the twisted process is exponentially ergodic, while the original diffusion is transient.
The proof of Theorem 2.1 is divided in several lemmas which also contain results of independent interest. These occupy the next section.
2.3 Proof of Theorem 2.1 and other results
In the sequel, we often use the following finite time representation. This also appears in [10, Lemma 2.4] but in a slightly different form. Let where denotes the exit time from the ball . Recall that if is an eigenpair of , and , then denotes the expectation operator associated with the twisted process in Eq. 2.17.
Lemma 2.3
If is an eigenpair of , then
[TABLE]
and for any function , where is the corresponding twisted process defined by Eq. 2.17.
Proof 4
The equation in Eq. 2.19 can be obtained by applying the Cameron–Martin–Girsanov theorem [36, p. 225]. Since and are not bounded, we need to localize the martingale. We use the first exit times from as localization times. It is well-known that assumption (A2) implies that as -a.s. Applying the Itô–Krylov formula and using Eq. 2.16, we obtain
[TABLE]
Let be any nonnegative, continuous function with compact support. Then from 4 we obtain
[TABLE]
where in the last line we use Girsanov’s theorem. Given any bounded ball , by Itô’s formula and Fatou’s lemma, we obtain from Eq. 2.15 that
[TABLE]
Therefore, if we write
[TABLE]
we deduce that the first term on the right hand side is equal to [math] for all sufficiently large since is compactly supported, while the second term converges as to the right hand side of Eq. 2.19 by Eq. 2.22 and dominated convergence. In addition, since has compact support, the term inside the expectation in the right hand side of 4 is bounded uniformly in . Since also \widetilde{\operatorname{\mathbb{E}}}_{x}^{\psi}\bigl{[}g(Y_{\uptau_{n}})\,\Psi^{-1}(Y_{\uptau_{n}})\bigr{]}=0 for all sufficiently large , letting in 4, we obtain
[TABLE]
This proves Eq. 2.19. \qed
Recall that . An immediate corollary to Lemma 2.3 is the following.
Corollary 2.2
With as in Lemma 2.3, we have
[TABLE]
Proof 5
Choose a sequence of cut-off functions that approximates unity from below. Then Eq. 2.19 holds with replaced by . Thus the result follows by letting and applying the monotone convergence theorem. \qed
We are now ready to prove uniqueness of the principal eigenfunction.
Lemma 2.4
Under (P1) there exists a unique ground state for , i.e., a positive , , which solves
[TABLE]
Proof 6
Let be a solution of Eq. 2.23 obtained as a limit of (see Lemma 2.2). Thus by Lemma 2.2 (iii) we can find an open ball such that
[TABLE]
with . Suppose that is another principal eigenfunction of . By the Itô–Krylov formula and Fatou’s lemma, and since is positive on , we obtain
[TABLE]
It is clear by 6 that if on , then on . Therefore, we can scale by multiplying it with so that touches from above in at the points . Denoting this scaled also as , it follows from 6 that is nonnegative in , and its minimum is [math] and attained in . On the other hand, we have
[TABLE]
Thus by the strong maximum principle [35, Theorem 9.6], and this proves the result. \qed
We next show that (P1) implies the exponential ergodicity of .
Lemma 2.5
Assume (P1). Let be the ground state of , and . Then the ground state process governed by
[TABLE]
is exponentially ergodic. In particular, is positive recurrent.
Proof 7
We first show that the finite time representation of holds. Let , and be a ball as in Lemma 2.2 (iii). Recall that as , and therefore, we have for all sufficiently large . Consider the following equations
[TABLE]
Choose large enough so that . We can scale , by multiplying it with a positive constant, so that touches from above. Next we show that it can only touch in . Note that in we have
[TABLE]
Therefore, by the strong maximum principle, if attains its minimum in , then in , which is not possible. Thus touches in . Thus, applying Harnack’s inequality we can find a constant such that for all sufficiently large . On the other hand, by the Itô–Krylov formula and Fatou’s lemma we know that
[TABLE]
Applying the Itô–Krylov formula to Eq. 2.3 we have
[TABLE]
and letting , using Eq. 2.26 and the dominated convergence theorem, we obtain
[TABLE]
where is the unique solution of Eq. 2.23. This proves the finite time representation. Thus it follows from Corollary 2.2 that Eq. 2.25 is regular, i.e., .
If we define , a straightforward calculation shows that
[TABLE]
for some positive constants and . Recall from Lemma 2.2 (iii). It is easy to see from Eq. 2.7 that
[TABLE]
Thus is uniformly bounded from below by a positive constant. Since in Eq. 2.25 is regular, the Foster–Lyapunov inequality in Eq. 2.27 implies that is exponentially ergodic. \qed
We denote the invariant measure of Eq. 2.25 by . The following lemma shows that the twisted process is transient for any .
Lemma 2.6
Let be an eigenfunction of for an eigenvalue . Then the corresponding twisted process is transient.
Proof 8
Let . If , then there is nothing to prove. So we assume the contrary. Hence from Lemma 2.3 we have
[TABLE]
for any continuous with compact support. Let . By the Itô–Krylov formula and Fatou’s lemma, we have
[TABLE]
Thus, for , we obtain
[TABLE]
Combining Eqs. 2.28 and 2.29, we have
[TABLE]
Therefore, is transient. \qed
Theorem 2.2
The following are equivalent.
The process , defined in Eq. 2.25, corresponding to some principal eigenpair \bigl{(}\Psi^{*},\lambda^{\!*}(f)\bigr{)} is exponentially ergodic. 2. 2.
It holds that for all . 3. 3.
It holds that for some .
Proof 9
(iii)(i) follows from Lemma 2.5, and (ii)(iii) is obvious.
We show that (i)(ii). If is exponentially ergodic, then there exists a ball and such that
[TABLE]
Mimicking the calculations in the proof of Lemma 2.3, we obtain that
[TABLE]
for . We apply this equation to an increasing sequence which converges to , and let first , and then , using Fatou’s lemma and the exponential ergodicity of , to obtain
[TABLE]
Let . Since is bounded, it is easy to see that is finite. Let , and \bigl{(}\tilde{\Psi}^{*},\lambda^{\!*}(\tilde{f})\bigr{)} be a solution of
[TABLE]
which is obtained as a limit of Dirichlet eigensolutions as in Lemma 2.2. If , then in view of Eq. 2.30 and the calculations in the proof of Lemma 2.2 (iii), we have
[TABLE]
Applying the Itô–Krylov formula and Fatou’s lemma to Eq. 2.23, we obtain
[TABLE]
It follows by Eqs. 2.32 and 2.33 that we can multiply with a suitable positive constant so that attains a minimum of [math] in . On the other hand, from Eqs. 2.23 and 2.31 we have
[TABLE]
Thus by strong maximum principle we have . This, in turn, implies that by Eq. 2.34. But this is not possible. Hence we have , and the proof is complete. \qed
We define the Green’s measure , , by
[TABLE]
The density of the Green’s measure with respect to the Lebesgue measure is called the Green’s function. Existence of a Green’s function (and Green’s measure) is used by Pinsky [2, Chapter 4.3] in his definition of the generalized principal eigenvalue of . A number is said to be subcritical if possesses a density, critical if it is not subcritical and has a positive solution , and supercritical if it is neither subcritical nor critical.
The lemma which follows is an extension of [2, Theorem 4.3.4] where, under a regularity assumption on the coefficients, it is shown that a critical eigenvalue is always simple. This result establishes several equivalences of the notion of criticality of .
Lemma 2.7
The following are equivalent.
The twisted process corresponding to the eigenpair is recurrent. 2. 2.
* is infinite for some .* 3. 3.
For some open ball , and with , we have
[TABLE]
where is an eigenfunction corresponding to the eigenvalue .
In addition, in (ii)–(iii)* “some” may be replaced by “all”, and if any one of (i)–(iii) holds, then is a simple eigenvalue.*
Proof 10
The argument of this proof is inspired from [10, Theorem 2.8]. By Corollary 2.1 we have . Assume that (i) holds for some . Let be an eigenpair of . Then for any we have from Lemma 2.3 that
[TABLE]
On the other hand, if is recurrent, then
[TABLE]
Combining this with Eq. 2.35 we have . Hence (ii) follows.
Next suppose that (ii) holds, i.e., for some and . Applying the Itô–Krylov formula to , we have
[TABLE]
for all , and for any bounded ball . Define , and
[TABLE]
for , and some . From Eq. 2.36 we have for all . Moreover, as by hypothesis. Choose large enough so that . Following [10, Theorem 2.8] we consider the positive solution of
[TABLE]
for . Since for every fixed we have
[TABLE]
by Eq. 2.36, applying the Itô–Krylov formula to Eq. 2.37, we obtain by [10, Theorem 2.8] that
[TABLE]
Since is bounded uniformly on by hypothesis, we can apply Harnack’s inequality for a class of superharmonic functions [37, Corollary 2.2] to conclude that is locally bounded, and therefore also uniformly bounded in , , for any . Thus, we have that weakly in along some subsequence, and that satisfies
[TABLE]
by Eq. 2.37. Let be an open ball centered at [math] such that . Applying the Itô–Krylov formula to Eq. 2.37 we obtain
[TABLE]
with . As in the derivation of Eq. 2.38, using Eq. 2.36 and a similar argument we obtain
[TABLE]
Letting along some subsequence, and arguing as above, we obtain a function which satisfies Eq. 2.39 and
[TABLE]
where Eq. 2.41 follows from Eq. 2.40. From Eq. 2.38 we have for all . Now applying Harnack’s inequality once again and letting , we deduce that converges weakly in , , to some positive function which satisfies in , and
[TABLE]
This implies (iii).
Lastly, suppose that (iii) holds. In other words, there exists an eigenpair and an open ball such that
[TABLE]
We first show that is a simple eigenvalue, which implies that there is a unique twisted process corresponding to . To establish the simplicity of consider another eigenpair of . By the Itô–Krylov formula we obtain
[TABLE]
Thus using Eq. 2.42 and an argument similar to Lemma 2.4 we can show that . Then (iii)(i) follows from [10, Lemma 2.6].
Uniqueness of the eigenfunction follows from the stochastic representation in Eq. 2.42 and the proof of (iii)(i). \qed
As an immediate corollary to Lemmas 2.6 and 2.7 we have the following.
Corollary 2.3
Let be an eigenpair of which satisfies
[TABLE]
for some bounded open ball in . Then , and it is a simple eigenvalue.
Theorem 2.3 below is a generalization of [2, Theorem 4.7.1] in , which is stated in bounded domains, and for bounded and smooth coefficients. It is shown in [2] that for smooth bounded domains, the Green’s measure is not defined at the critical value [2, Theorem 3.2]. But by Theorem 2.3 below we see that this is not the case on . In fact, [2, Theorem 4.3.2] shows that could be either subcritical or critical in the sense of Pinsky. We show that the criticality of is equivalent to the strict monotonicity of on the right, i.e., for all .
Theorem 2.3
A ground state process is recurrent if and only if for all .
Proof 11
Suppose first that a ground state process corresponding to is recurrent. Then for all by Lemma 2.7. Let and . Suppose that . Let be a principal eigenfunction of , i.e.,
[TABLE]
Writing Eq. 2.43 as , and applying the Itô–Krylov formula, followed by Fatou’s lemma, we obtain
[TABLE]
which contradicts the property that for all . Therefore, for all .
To prove the converse, suppose that is transient. Then for with we have . Following the arguments in the proof of (ii)(iii) in Lemma 2.7, we obtain a positive satisfying
[TABLE]
Let . Then from Eq. 2.44 we have
[TABLE]
This implies that by Lemma 2.2 (ii). Thus . Therefore, if for all , then has to be recurrent. This completes the proof. \qed
It is well known that a (null) recurrent diffusion with locally uniformly elliptic and Lipschitz continuous , and locally bounded measurable drift, admits a -finite invariant probability measure which is a Radon measure on the Borel -field of [38]. This measure is equivalent to the Lebesgue measure and is unique up to a multiplicative constant. Theorem 8.1 in [38] states that if and are real-valued functions which are integrable with respect to the measure then
[TABLE]
Suppose is a non-trivial function. Select as the indicator function of some open ball. Then it is well known that the expectation of tends to as . Adopt the analogous notation , and let . Let be arbitrary, and select large enough such that \operatorname{\mathbb{E}}\bigl{[}Y^{h}_{t_{0}}\bigr{]}\geq 2M. Then of course we may find a positive constant such \operatorname{\mathbb{E}}\bigl{[}Y^{h}_{t_{0}}\,\mathds{1}_{\{Y^{h}_{t_{0}}\leq\kappa\}}\bigr{]}\geq M. Since and are nondecreasing in , it follows by Eq. 2.45 that
[TABLE]
This of course implies, using dominated convergence, that \liminf_{t\to\infty}\,\operatorname{\mathbb{E}}\bigl{[}Y^{g}_{t}\,\mathds{1}_{\{Y^{h}_{t_{0}}\leq\kappa\}}\bigr{]}\geq\alpha M. Since was arbitrary, this shows that \operatorname{\mathbb{E}}\bigl{[}Y^{g}_{t}\bigr{]}\to\infty as , or equivalently that . Using this property in the proof of Theorem 2.3 we obtain the following corollary.
Corollary 2.4
For to be strictly monotone at on the right it is sufficient that there exists some non-trivial Borel measurable bounded function with compact support satisfying for all .
2.3.1 Minimal growth at infinity
We next discuss the property known as minimal growth at infinity [5, Definition 8.2]. As shown in [5, Proposition 8.4], minimal growth at infinity implies that the eigenspace corresponding to the eigenvalue is one dimensional, i.e., is simple. We start with the following definition, which is a variation of [5, Definition 8.2].
Definition 2.3
A positive function is said to be a solution of minimal growth at infinity of , if for any and any positive function satisfying a.e., in , there exists and such that in .
Define the generalized principal eigenvalue of in the domain by
[TABLE]
Note that . It is also clear from this definition that for we have .
It is shown in [5, Theorem 8.5] that the hypothesis
- (A1)
implies that the ground state of is a solution of minimal growth at infinity.
On the other hand, the following result has been established in [39, Theorem 2.1].
Theorem 2.4
The ground state of is a solution of minimal growth at infinity of if and only if is strictly monotone at on the right.
It thus follows by the above results that (A1) is a sufficient condition for strict monotonicity of on the right. It turns out that (A1) is equivalent to strict monotonicity and, moreover, the map is either negative on or identically equal to [math]. This is the subject of the following theorem.
Theorem 2.5
The following are equivalent.
. 2. 2.
* is strictly monotone at .* 3. 3.
.
Proof 12
It easily follows by Lemmas 2.5 and 2.2 and the definition of that (b)(c). Thus it remains to prove that (a)(b). Suppose that for some . Using the Dirichlet eigenvalues for the annulus , for , and letting , we can construct a solution of on , with on , and on . Then is bounded away from [math] on for all . Using any , we extend smoothly inside to obtain some function which is strictly positive on and agrees with on . Let , and . Then , and therefore, we have
[TABLE]
which implies strict monotonicity at , and completes the proof. \qed
2.4 Potentials vanishing at infinity
Let denote the class of bounded Borel measurable functions which are vanishing at infinity, i.e., satisfying , and the class of nonnegative functions in which are not a.e. equal to [math].
Theorem 2.6 which follows is a (pinned) multiplicative ergodic theorem (compare with [22, Theorem 7.1]). Note that the continuity result in this theorem is stronger than that of [5, Proposition 9.2]. See also Remark 4.1 on the continuity of for a larger class of . We introduce the eigenvalue defined by
[TABLE]
Theorem 2.6
Let . If the solution of Eq. 2.1 is recurrent, then . In addition, if the solution of Eq. 2.1 is positive recurrent with invariant measure , and , the following hold:
- (a)
for any measurable with compact support we have
[TABLE]
for some positive constant . Moreover, the corresponding twisted process is exponentially ergodic.
- (b)
If is a sequence of functions in satisfying , and converging to in , and also uniformly outside some compact set , then .
Proof 13
Applying the Itô–Krylov formula to , it is easy to see that . Also, from [10, Lemma 2.3] we have . Thus we obtain . If , then by [5, Theorem 1.9 (iii)] we have which in turn implies that . On the other hand, if , then is near-monotone, relative to , in the sense of [10]. Applying [10, Lemma 2.1] we again obtain .
We now turn to part (a). Applying Jensen’s inequality it is easy to see that . Therefore, . Taking and mimicking the arguments of Theorem 2.1 we see that is exponentially ergodic. Let be the unique invariant measure of . Then Eq. 2.47 follows from Eq. 2.28 and [40, Theorem 1.3.10] with .
Next we prove part (b). By the first part of the theorem we have for all , and by the lower-semicontinuity property of it holds that . Let and . Then by Theorem 2.2 we have . Choose a open ball , containing , such that and for all sufficiently large . Let denote the principal eigenpair. Then
[TABLE]
We can choose large enough such that
[TABLE]
where . Suppose . It is standard to show that for some positive , it holds that weakly in , , as , and therefore, from Eq. 2.48 we have
[TABLE]
Therefore, . Note that on we have
[TABLE]
for all sufficiently large. Since \operatorname{\mathbb{E}}_{x}\bigl{[}\mathrm{e}^{\int_{0}^{\breve{\uptau}}[f(X_{t})-\lambda^{\!*}(\tilde{f})]\,\mathrm{d}{t}}\,\bigr{]}<\infty, passing to the limit in Eq. 2.49, and using the dominated convergence theorem, we obtain that
[TABLE]
Therefore, by Corollary 2.3. This completes the proof. \qed
We pause for a moment to provide an example where (P2) holds but (P1) fails.
Example 2.1
Let and . If , then the ground state is a constant function, and in turn, the ground state diffusion is a two dimensional Brownian motion, hence recurrent. It follows that is strictly monotone on the right at [math]. Now let a non-trivial non-negative continuous function with compact support. It is clear that for . On the other hand, by Theorem 2.6, we have for all . Therefore, for , we have
[TABLE]
Thus for all , which implies that it is not strictly monotone at [math].
In the rest of this section we show how the previous development can be used to obtain results analogous to those reported in [16], without imposing any smoothness assumptions on the coefficients. With , we have
[TABLE]
Note that Eq. 2.50 is a particular form of a more general class of quasilinear pdes of the form
[TABLE]
where the function , with , serves as a Hamiltonian. Let be a non-constant, nonnegative continuous function satisfying , and define , . Then by [5, Proposition 2.3 (vii)] we know that is non-decreasing and convex. For the diffusion matrix equal the identity, Ichihara studies some qualitative properties of in [16] associated to the pde Eq. 2.51, and their relation to the recurrence and transience behavior of the process with generator
[TABLE]
It is clear that if , then is the generator of the twisted process corresponding to . One of the key assumptions in [16, Assumption (H1) (i)] is that for all and . Note that this forces to be [math].
Let
[TABLE]
It is easy to see that . The following result is an extension of [16, Theorems 2.2 and 2.3] to measurable drifts and potentials .
Theorem 2.7
Let . Then the twisted process corresponding to the eigenpair is transient for , exponentially ergodic for , and, provided a.e. outside some compact set, it is recurrent for . In addition, the following hold.
*If is self-adjoint *(i.e., **), with the matrix bounded, uniformly elliptic and radially symmetric in , and the solution of Eq. 2.1 is transient, then . Also for all . 2. 2.
Provided that the solution of Eq. 2.1 is recurrent, then if it is exponentially ergodic, and otherwise. 3. 3.
Assume that , and that Eq. 2.1 is recurrent in the case that . Let and denote the ground state and the invariant probability measure of the ground state diffusion, respectively, corresponding to . Then it holds that
[TABLE]
where, as usual, .
Proof 14
The first part of the proof follows from Theorems 2.1 and 2.3, and Corollary 2.4. Next we proceed to prove (i). Suppose . Then , i.e., the twisted process corresponding to , is exponentially ergodic. By [5, Theorem 1.9 (i)–(ii)] we have . Moreover, is a ground state. Therefore, the twisted process must be given by Eq. 2.1, which is transient by hypothesis. This is a contradiction. Hence . Since is convex, it follows that is constant in . Hence for . This proves (i).
We now turn to part (ii). By Theorem 2.6 we have . We claim that if the solution of Eq. 2.1 is recurrent then , whenever . Indeed, arguing by contradiction, if for some , then on , which implies that that is a nonnegative supermartingale, and since it is integrable, it converges a.s. Since the process is recurrent, this implies that must equal to a constant, which, in turn, necessitates that , a contradiction. This proves the claim, which in turn implies that if the solution of Eq. 2.1 is recurrent then . Now suppose that is negative. Then the twisted process corresponding to is exponentially ergodic by Theorem 2.1. Since , the ground state diffusion for agrees with Eq. 2.1, which implies that the latter is exponentially ergodic.
Next, suppose that , and therefore also is exponentially ergodic. It then follows from Theorem 2.2 that is strictly monotone at [math]. This of course implies that . The proof of part (ii) is now complete.
Next we prove part (iii). We distinguish two cases.
Case 1.* Suppose . Let {\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}={\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}_{\beta}\coloneqq(\Psi^{*}_{\beta})^{-1} and \breve{\psi}\coloneqq\log{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}. Then *
- satisfies*
[TABLE]
Since , there exists and a ball such that for all . Applying the Feynman–Kac formula, it follows from **[10*, Lemma 2.1]** that \inf_{\mathbb{R}^{d}}\,{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}=\min_{\bar{\mathscr{B}}}\,{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}. Thus *
- is bounded away from [math] on . Let denote the ground state process corresponding to the eigenvalue . Simplifying the notation we let . By the exponential Foster–Lyapunov equation Eq. 2.53 we have that (see [34, Lemma 2.5.5])*
[TABLE]
Using this estimate together with the fact that \inf_{{\mathbb{R}^{d}}}{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}>0, we obtain
[TABLE]
Next, we show that
[TABLE]
where denotes the exit time from the ball . First, there exists some constant such that \bigl{(}\beta f-\Lambda_{\beta}\bigr{)}{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}\leq k_{0} on . Thus \widetilde{\operatorname{\mathbb{E}}}_{x}^{*}\bigl{[}{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}(Y^{*}_{t})\bigr{]}\leq k_{0}t+{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}(x) by Eq. 2.53, and of course also \widetilde{\operatorname{\mathbb{E}}}_{x}^{*}\bigl{[}{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}(Y^{*}_{t\wedge\uptau_{R}})\bigr{]}\leq k_{0}t+{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}(x) for all . Let for . Then
[TABLE]
Taking limits as , and since is arbitrary, it follows that
[TABLE]
Write
[TABLE]
Without loss of generality we assume {\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}\geq 1. Since \lvert\breve{\psi}\rvert\leq{\mathchoice{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\displaystyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\displaystyle\Psi}}}}{{\ooalign{\hbox{\raise 7.09259pt\hbox{\scalebox{1.0}[-1.0]{\lower 7.09259pt\hbox{\textstyle\widehat{\vrule width=0.0pt,height=6.83331pt\vrule height=0.0pt,width=7.7778pt}}}}}\cr\hbox{\textstyle\Psi}}}}{{\ooalign{\hbox{\raise 6.40926pt\hbox{\scalebox{1.0}[-1.0]{\lower 6.40926pt\hbox{\scriptstyle\widehat{\vrule width=0.0pt,height=4.78333pt\vrule height=0.0pt,width=5.44446pt}}}}}\cr\hbox{\scriptstyle\Psi}}}}{{\ooalign{\hbox{\raise 5.9537pt\hbox{\scalebox{1.0}[-1.0]{\lower 5.9537pt\hbox{\scriptscriptstyle\widehat{\vrule width=0.0pt,height=3.41666pt\vrule height=0.0pt,width=3.8889pt}}}}}\cr\hbox{\scriptscriptstyle\Psi}}}}}, an application of Fatou’s lemma shows that
[TABLE]
We use this together with Eqs. 2.57 and 2.58 to obtain Eq. 2.56.
We write Eq. 2.50 as
[TABLE]
Let . Applying the Itô–Krylov formula to 14, we obtain
[TABLE]
Letting in Eq. 2.60, using Eq. 2.56, then dividing by and letting , using Eq. 2.55 and Birkhoff’s ergodic theorem, we obtain
[TABLE]
which is the assertion in part (iii).
Case 2.* Suppose and Eq. 2.1 is recurrent. The case is then trivial, since , so we assume that . Then Eq. 2.1 is exponentially ergodic by part (ii), and thus is bounded below in by [10, Lemma 2.1]. With , in analogy to 14 we have*
[TABLE]
We claim that
[TABLE]
where as defined earlier, , and denotes the ground state process. Assuming Eq. 2.62 is true, we first apply the Itô–Krylov formula to Eq. 2.61 to obtain the analogous equation to Eq. 2.60, and then take limits and use Birkhoff’s ergodic theorem to establish Eq. 2.52.
It remains to prove Eq. 2.62. Choose so that , and let denote the ground state corresponding to . We choose a ball such that
[TABLE]
Since vanishes at infinity, and , there exists a constant and a ball also denoted as , such that
[TABLE]
Since the ground state processes corresponding to the principal eigenvalues and are ergodic we have from Lemma 2.7 that
[TABLE]
for all where . By Eq. 2.27, the function satisfies
[TABLE]
Applying the Feynman–Kac formula to Eq. 2.66, using Eq. 2.63, it follows as in [10, Lemma 2.1] that . Thus is bounded away from [math] on .
Let . Then by Eqs. 2.64 and 2.65 we obtain
[TABLE]
Therefore, for some constant we have
[TABLE]
Let . From Eq. 2.63 and exponential Foster–Lyapunov equation Eq. 2.66 we deduce that Eq. 2.54 holds for . Thus the first equation in Eq. 2.62 follows directly from Eqs. 2.54 and 2.67 and the fact that , while the second one follows by repeating the argument leading to Eq. 2.57. This completes the proof. \qed
Remark 2.1
The assumption that Eq. 2.1 is recurrent in the case that in Theorem 2.7 (iii) is equivalent to the statement that . Note that as shown in [18, Theorem 2.1], unless , then Eq. 2.52 does not hold if .
If Eq. 2.1 is not recurrent, then it is possible that and also that for . Consider a diffusion with , , and . Then, we have for . Thus , where denotes the eigenvalue in Eq. 2.5 for . Thus by Lemma 2.2 (b). Since the twisted process corresponding to is exponentially ergodic, we must have by Theorem 2.1 (c), and thus is the ground state. Theorem 2.1 (b) then asserts that is strictly increasing at . Thus . Observe that the ground state diffusion is an Ornstein–Uhlenbeck process having a Gaussian stationary distribution of mean [math] and variance . An easy computation reveals that \mu^{*}\bigl{(}-\langle\nabla\psi^{*},a\nabla\psi^{*}\rangle\bigr{)}=-2 which is smaller than .
The conclusion of Theorem 2.7 (iii) can be sharpened. Consider the controlled diffusion
[TABLE]
Here is a locally bounded Borel measurable map. Let denote the class of such maps. These are identified with the class of locally bounded stationary Markov controls. Let be the collection of those under which the diffusion in Eq. 2.68 is ergodic, and denote by the associated invariant probability measure. We let , and use the symbol to denote the expectation operator associated with Eq. 2.68.
In order to simplify the notation, we use the norm . For we define
[TABLE]
and .
Theorem 2.8
Assume that and . Then the following hold
If , then we have
[TABLE]
In addition, if satisfies , then a.e. 2. 2.
If and Eq. 2.1 is recurrent then Eq. 2.69 holds, and is the a.e. unique control in which satisfies . 3. 3.
If and Eq. 2.1 is not recurrent, then for all .
Proof 15
We start with part (a). By Theorem 2.7 (iii), we have in both of cases (a) and (b). It suffices then to show that if for some , then a.e. in . Let such a control be given. Then Eq. 2.68 must be positive recurrent under , for otherwise we must have \mathscr{J}_{x}(v)\geq\limsup_{T\to\infty}\;\frac{1}{T}\,\widehat{\operatorname{\mathbb{E}}}^{v}_{x}\bigl{[}\int_{0}^{T}-\beta f(Z_{s})\,\mathrm{d}s\bigr{]}\geq 0. Therefore,
[TABLE]
where , as defined earlier, denotes the invariant probability measure associated with . Thus, does not depend on , and dropping this dependence in the notation we let . Since , it follows by Eq. 2.70 and the definition of that there exists a ball such that
[TABLE]
By Eq. 2.71, and since is locally bounded, and is integrable with respect to , we can assert the existence of a solution to the Poisson equation
[TABLE]
which is bounded below in (see Lemma 3.7.8 (d) in [34]). It follows by Eq. 2.72 that , , satisfies
[TABLE]
This shows that \bigl{(}\Phi,-\mathscr{J}(v)\bigr{)} is an eigenpair for , with . The corresponding twisted process with generator then satisfies
[TABLE]
Since is bounded below in and , Eq. 2.74 shows that the twisted process is positive recurrent. We claim that is the principal eigenvalue of . Indeed, if then by the proof of Lemma 2.3 and for any we obtain
[TABLE]
for all sufficiently large , where denotes the first exit time from . By first letting , and then integrating with respect to we obtain
[TABLE]
But this contradicts the positive recurrence of the twisted process corresponding to . Therefore, must be the principal eigenvalue of , which implies that
[TABLE]
Thus we have shown that . The strict monotonicity of at together with Eq. 2.75 imply that a.e. in . In turn, Eq. 2.73 and the uniqueness of the ground state imply that , up to a multiplication by a positive constant. Therefore, we have a.e. in , and this completes the proof of part (a).
We continue with part (b). The case is trivial, so assume that . Then Eq. 2.1 is exponentially ergodic by Theorem 2.7(ii). Thus is bounded away from [math] in by [10, Lemma 2.1]. Let , and . We have
[TABLE]
Since is bounded above in , it follows from Eq. 2.76 by a standard argument that
[TABLE]
We next show uniqueness in of the optimal control . Let and suppose . In other words, . By the Itô–Krylov formula and Fatou’s lemma and since is bounded above, we obtain from Eq. 2.76 that
[TABLE]
with
[TABLE]
Dividing Eq. 2.77 by and taking limits as , we obtain . Therefore, , since is nonnegative. Thus a.e. in , and since has a density, this implies that a.e. in .
We now turn to part (c). It is evident that under the control , since the diffusion in Eq. 2.68 is transient and vanishes at infinity, we have \lim_{t\to\infty}\,\frac{1}{t}\,\widehat{\operatorname{\mathbb{E}}}^{v}_{x}\bigl{[}F_{0}(Z_{t})\bigr{]}=0. It is also clear that under any control we have \lim_{t\to\infty}\,\frac{1}{t}\,\widehat{\operatorname{\mathbb{E}}}^{v}_{x}\bigl{[}F_{v}(Z_{t})\bigr{]}\geq 0. Suppose that under some , we have
[TABLE]
Then there exists a solution to the Poisson equation Eq. 2.72 which is bounded below in . Thus following the proof of Case 1 in part (a) we obtain by Eq. 2.75 that which is a contradiction. We have therefore shown that for all , which implies that [math] is the optimal value in the class of controls . \qed
Remark 2.2
The assumption that is nonnegative can be weakened to . From the proof of Theorem 2.2 we note that if for some , then the ground state diffusion corresponding to is geometrically ergodic. Moreover, due to [5, Proposition 2.3 (vii)] the function is convex for every . Instead of the critical value , we can define a critical value by . Then if we replace the condition by as done in [18], it is evident that is strictly monotone at and the results in Theorem 2.7 (iii) and Theorem 2.8 still hold, provided , and the proofs are the same.
The results in Theorem 2.8 (b) can be also stated for nonstationary controls. Consider the controlled diffusion
[TABLE]
Here is an valued control process which is jointly measurable in , and is nonanticipative: for , is independent of
[TABLE]
Let , not necessarily nonnegative. Assume that , , and Eq. 2.1 is recurrent (see Remark 2.2). Suppose that under , the diffusion in Eq. 2.78 has a unique weak solution. We claim that
[TABLE]
We can prove this as follows. By 14 we obtain
[TABLE]
for all , and we apply the Itô–Krylov formula and Fatou’s lemma (using the fact that is bounded above) with to obtain analogously to Eq. 2.77 that
[TABLE]
Dividing Eq. 2.79 by and letting , we obtain
[TABLE]
thus proving the claim.
2.4.1 Strong duality
The optimality result in Theorem 2.8 can be strengthened. Consider the class of infinitesimal ergodic occupation measures, i.e., measures which satisfy
[TABLE]
with . Disintegrate these as , and denote this disintegration as . Let . Since , and is also an ergodic occupation measure, it is enough to consider the class of infinitesimal ergodic occupation measures that correspond to a precise control , i.e., a Borel measurable map from to . We denote this class by . Thus for , Eq. 2.80 takes the form . Note that is not necessarily locally bounded, so this class of controls is, in general, larger than .
In Theorem 2.9 below we use the following simple assertions which are stated as remarks.
Remark 2.3
If has density , and , then
[TABLE]
This can be proved as follows. We mollify with a smooth mollifier family , so that Eq. 2.80 can applied to the function , where ‘’ denotes convolution. Then we separate terms, and applying the Hölder inequality on \bigl{\lvert}\int(\mathscr{L}g-\mathscr{L}(g*\chi_{r}))\rho_{v}\bigr{\rvert}, and using the convergence of to in , we deduce that this term tends to [math] as . Similarly, we apply the Hölder inequality in the form
[TABLE]
Then the first integral on the right hand side is bounded, and the second integral vanishes as since converges to uniformly on compact sets.
Remark 2.4
Suppose that the drift in Eq. 2.1 has at most affine growth. It is then well known that the map is inf-compact for any open ball , provided of course that Eq. 2.1 is positive recurrent. This fact together with the stochastic representation in Eq. 2.18 and Jensen’s inequality, imply that if , , and Eq. 2.1 is recurrent, then the ground state is inf-compact, and this of course renders Eq. 2.1 positive recurrent. An analogous argument using the ground state diffusion shows that, if has at most affine growth, and (see Remark 2.2), then is inf-compact.
The theorem that follows shows that there is no optimality gap between the primal problem which consists of minimizing subject to the constraint Eq. 2.80, and the dual problem which amounts to a maximization over subsolutions of the HJB equation, as described in Section 1. This theorem is stated for which is not necessarily nonnegative as discussed in Remark 2.2.
Theorem 2.9
Assume that , , and that one of the following conditions holds.
, the coefficients and are bounded, and is uniformly strictly elliptic. 2. 2.
, Eq. 2.1 is recurrent, and has at most affine growth.
Then any , such that , satisfies
[TABLE]
In addition, if is optimal, i.e., if it satisfies , then a.e. in and .
Proof 16
We first consider case (i). Since , , and are bounded, it follows that is bounded by [10, Lemma 3.3]. Then is inf-compact by Remark 2.4. Recall that . We have
[TABLE]
Let be a convex function such that for , for , and , are positive on . Define , . Then we have from Eq. 2.82 that
[TABLE]
Since for all , an application of [41, Theorem 2.1] shows that has a density . Note that this does not require or to be bounded. Therefore, since has compact support, we have by Remark 2.3. Thus letting in Eq. 2.83, using monotone convergence, we obtain Eq. 2.81.
We next show uniqueness. Let be optimal, and denote the ergodic occupation measure corresponding to . Here, . Let denote the density of . Define and , with and given by and , respectively. It is straightforward to verify, using the fact that the drift is affine in the control, that is in .
By optimality, we have
[TABLE]
Since is strictly positive, 16 implies that a.e. in , and thus on the support of . It is clear that if is modified outside the support of , then the modified is also an infinitesimal ergodic occupation measure. Therefore . The uniqueness of the invariant measure of the diffusion with generator then implies that , which in turn implies that a.e. in .
We now turn to case (ii). By Remark 2.4, is inf-compact. Also, as shown in case (i), has a density . We write Eq. 2.76 as
[TABLE]
with . Then we have from Eq. 2.85 that
[TABLE]
Using the inequality , then integrating Eq. 2.86 with respect to , and rearranging terms we obtain
[TABLE]
Thus letting in Eq. 2.87, using monotone convergence, we obtain the energy inequality
[TABLE]
Then Eq. 2.81 follows by letting in Eq. 2.87, using again monotone convergence and Eq. 2.88. Uniqueness follows as in case (i). This completes the proof. \qed
Remark 2.5
The proof of Theorem 2.9 provides a general recipe to prove the lack of an optimality gap in ergodic control problems. Note that the model in [16] is such that is bounded, and is also bounded. Therefore,
[TABLE]
and the proof of Theorem 2.9 goes through even for the more general Hamiltonian in [16].
Remark 2.6
If , and under some nonanticipative control the diffusion Eq. 2.78 has a unique weak solution, it was shown in the discussion following Remark 2.2 that , provided and Eq. 2.1 is recurrent. The same conclusion can be drawn if and under the hypotheses of Theorem 2.9. Define the set of mean empirical measures \bigl{\{}\xi^{U}_{x,t}\,,\;t\geq 0\} of Eq. 2.78 under the control by
[TABLE]
If , then is bounded away from zero for all outside some compact set, and one can follow the arguments in the proof of [34, Lemma 3.4.6] to show that every limit point in (the set of Borel probability measures on the one-point compactification of ) of a sequence of mean empirical measures as takes the form , where is an infinitesimal ergodic occupation measure and . Using this property, one can show, by following the argument in the proof of [34, Theorem 3.4.7], that if , then the mean empirical measures are necessarily tight in and in this decomposition. This of course implies that . This argument establishes optimality over the largest possible class of controls .
2.4.2 Differentiability of
Differentiability of the map for all is established in [18, Proposition 5.4] under the hypothesis that the coefficients , , and are Lipschitz continuous and bounded in , but for a more general class of Hamiltonians (see (A1)–(A3) in [18]). These assumptions are used to show that is bounded in , and this is utilized in the proofs.
In the next theorem we demonstrate this differentiability result for the model in this paper which assumes only measurable and , in which case it is not possible, in general, to obtain gradient estimates and follow the approach in [16, 18, 19]. The first assertion in this theorem should be compared to [18, Proposition 5.4]. Recall the definition after Eq. 2.65, and let .
Theorem 2.10
Suppose , and that . Then for all such that , we have
[TABLE]
In addition, we have
[TABLE]
Proof 17
Fix some such that , and consider Eq. 2.66. As argued in the proof of Theorem 2.7, the function is bounded away from [math] on for all . We recall the notation . Applying the Itô–Krylov formula and Fatou’s lemma to Eq. 2.66 we obtain
[TABLE]
from which the left hand side inequality of Eq. 2.89 follows by an application of Birkhoff’s ergodic theorem. Also the analogous estimate to Eq. 2.54 holds for , which implies that
[TABLE]
The second equality in Eq. 2.89 then follows by first using the technique in the proof of Theorem 2.7 and Eq. 2.91 to establish Eq. 2.56 for , , and then applying the Itô–Krylov formula to the log-transformed equation corresponding to Eq. 2.66 as in Eq. 2.60, and taking limits at .
Using the convexity of , we write Eq. 2.89 as
[TABLE]
Fix an open ball , such that
[TABLE]
This is clearly possible since is nonincreasing, , and vanishes at infinity. Let . Since the ground state process corresponding to is exponentially ergodic for by Theorem 2.7, we have
[TABLE]
by Lemma 2.7. Since and its inverse are bounded on , uniformly in , it follows from Eqs. 2.93 and 2.94 that there exists such that for all . Therefore, since the collection \bigl{\{}\Psi^{*}_{\beta-\epsilon}\,,\,\epsilon\in[-\epsilon_{1},\epsilon_{1}]\bigr{\}}, is bounded in , , we can use Eq. 2.94 and the dominated convergence theorem to conclude that as . Thus, one more application of the dominated convergence theorem shows that and as . This shows that
[TABLE]
We next study the term . Let denote the expectation operator for the ground state diffusion corresponding to . Since
[TABLE]
it follows by an estimate similar to Eq. 2.93 that for all (see also Theorem 3.1 in Section 3).
We claim that
[TABLE]
Indeed, let be a larger ball such that . It suffices to exhibit the result for . For some positive constants , , we have
[TABLE]
and also (see [34, Theorem 2.6.1])
[TABLE]
We use the inequality , which follows from the well-known characterization of invariant probability measures due to Hasminskiĭ [34, Theorem 2.6.9], and which establishes the claim.
It follows from Eq. 2.96 that the corresponding densities are locally bounded and also bounded away from [math] uniformly in by the Harnack inequality (see proof of equation (3.2.6) in [34]). Therefore, standard pde estimates of the Fokker–Planck equation show that this family of densities is locally Hölder equicontinuous [35, Theorem 8.24, p. 202]. Given any we may enlarge so that and on . Let be the (uniform) limit of on along some subsequence . Since is Hölder equicontinuous on , uniformly in as argued earlier, it follows that is strictly positive on . It is straightforward to show then that is a positive solution of the Fokker–Planck equation for the (adjoint of the) operator . By the uniqueness of the invariant probability measure we have for some positive constant . Since , we have . Thus, since , and on , by Fatou’s lemma we obtain
[TABLE]
Since can be selected arbitrarily close to [math], we obtain from Eq. 2.92 that . Combining this with Eqs. 2.94 and 2.95 we obtain Eq. 2.90. \qed
3 Exponential ergodicity and strict monotonicity of principal eigenvalues
In this section we show that exponential ergodicity of Eq. 2.1 is a sufficient condition for the strict monotonicity of the principal eigenvalue. In [17, 9] exponential ergodicity is used to obtain results similar to Theorem 2.1. In these studies the coefficients , , and are assumed to be , and this assumption seems hard to waive as the technique used relies on a gradient estimate (see [17, Theorem 3.1] and [9, Lemma 2.4]) which is not available for less regular coefficients. Our approach has allowed us to obtain the results in Section 2 under much weaker hypotheses on the coefficients. Under some additional hypotheses, we show in this section that . Recall the definition of in Eq. 2.46. It is straightforward to show that . We present an example where , and therefore also .
Example 3.1
Let be a smooth function which is strictly positive on and satisfies for . Define
[TABLE]
Then for , and
[TABLE]
Consider the one-dimensional controlled diffusion
[TABLE]
From Eq. 3.1 and Lemma 2.2 (ii) we have . It is clear that Eq. 3.2 is a transient process. Therefore, for any initial data
[TABLE]
Hence .
Remark 3.1
Example 3.1* presents a case where the conclusion of [5, Theorem 1.9] fails to hold. Since the operator in this example is uniformly elliptic, has bounded coefficients, and , the only aspect that makes it different from the class of operators in part (i) of [5, Theorem 1.9], is that it is not self-adjoint.*
Let us start by summarizing some equivalent characterizations of exponential ergodicity.
Theorem 3.1
The following are equivalent.
For some ball there exists and such that . 2. 2.
For every ball there exists such that for all . 3. 3.
For every ball , there exists a positive function , with , and positive constants and such that
[TABLE] 4. 4.
Equation 2.1* is recurrent, and for every ball .*
Proof 18
We first show that (a)(d). It is clear that (a) implies that Eq. 2.1 is positive recurrent, and that it is enough to prove that for any . Let , and consider the Dirichlet eigensolutions in Eq. 2.3. It is easy to see that for all . We claim that . If not, then as , and converges to some , , which satisfies on and . The same argument used in the proof of Lemma 2.2 then shows that \Psi(x)=\operatorname{\mathbb{E}}_{x}\bigl{[}\Psi(X_{\uptau(\mathscr{B}^{c}_{\circ})}\bigr{]}. Therefore, attains a maximum on , and by the strong maximum principle it must be constant. Thus which contradicts the fact that .
Next we show that (d)(c). If , for , then any limit point of the Dirichlet eigenfunctions as satisfies
[TABLE]
Also by [10, Lemma 2.1 (c)], we have . Thus (c) holds with .
That (c)(b) is well known, and can be shown by a standard application of the Itô–Krylov formula to Eq. 3.3, by which we obtain
[TABLE]
The result then follows by letting , and this completes the proof. \qed
We introduce the following hypothesis.
- (H2)
There exists a lower-semicontinuous, inf-compact function such that , where is as defined in Eq. 1.3.
Lemma 3.1
Under (H2), we have , and there exists a positive , , with , and , satisfying
[TABLE]
In particular, the unique strong solution of Eq. 2.1 is exponentially ergodic.
Proof 19
By Eq. 2.4 we have
[TABLE]
Since , Eq. 3.5 implies that
[TABLE]
for some . The inf-compactness of then implies that the unique strong solution of Eq. 2.1 is positive recurrent. That , and the existence of a solution then follow by Theorem 1.4 and Lemma 2.1 in [10], respectively. Exponential ergodicity then follows from Eq. 3.4, using Theorem 3.1. \qed
An application of the Itô–Krylov formula to Eq. 3.4, followed by Fatou’s lemma, shows that
[TABLE]
where , as defined earlier, denotes the first hitting time of the ball .
The next result shows that (H2) implies (P1).
Theorem 3.2
Assume (H2), and suppose that is a potential such that is inf-compact. Then for any continuous we have
[TABLE]
Proof 20
Let , and . It is easy to see that and are both finite. It is shown in [10, 28] that the Dirichlet eigensolutions in Eq. 2.3 converge, along some subsequence as , to \bigl{(}\Psi^{*},\lambda^{\!*}(f)\bigr{)} which satisfies
[TABLE]
It is also clear that Lemma 2.2 (i) holds for . Now choose a bounded ball such that
[TABLE]
This is possible since is inf-compact. In view of Eq. 3.6 we note that Eq. 2.12 holds with replaced by . Thus with the above choice of , we can justify the passing to the limit in Eq. 2.14, and therefore, we obtain
[TABLE]
with . Recall the definition of \bigl{(}\tilde{\Psi}^{*},\lambda^{\!*}(\tilde{f})\bigr{)} in Eq. 2.31. A similar argument also gives
[TABLE]
In fact, the above relations hold for any bounded domain with .
Suppose that . Then
[TABLE]
Thus if we multiply with a suitable positive constant such that is nonnegative in and attains a minimum of [math] in , it follows from Eqs. 3.8 and 3.9 that is nonnegative in . Since Eq. 2.34 holds, and we conclude exactly as in the proof of Theorem 2.2 that .
Next we show that for all . We have already established the strict monotonicity of at , and therefore, Theorem 2.1 applies. Hence for any continuous with compact support we have from [40, Theorem 1.3.10] that
[TABLE]
where denotes the invariant measure of the twisted process satisfying Eq. 2.25. Let be a ball such that for . Thus from Eq. 3.4 we obtain
[TABLE]
with \kappa=\max_{\tilde{\mathscr{B}}}\bigl{(}\lvert f\rvert+\lvert\ell\rvert+\lvert\lambda^{\!*}\rvert+\lambda^{\!*}(\ell)\bigr{)}\,V. Applying the Itô–Krylov formula to Eq. 3.11 followed by Fatou’s lemma we obtain
[TABLE]
for some constant , where in the last inequality we have used Eq. 3.10. Taking logarithms on both sides of the preceding inequality, then dividing by , and letting , we obtain for all . Combining this with Eq. 3.7 results in equality. \qed
Remark 3.2
Continuity of is superfluous in Theorem 3.2. The result holds if is a non-trivial, nonnegative measurable function, vanishing at infinity.
Corollary 3.1
Under the assumptions of Theorem 3.2, for any potential , we have .
Proof 21
Note that for any cut-off function we have . Then the result follows from Theorems 3.2 and 3.2. \qed
Remark 3.3
In Theorem 3.2 we can replace the assumption that is bounded from below in by the hypothesis that is inf-compact.
Let us now discuss the exponential ergodicity and show that this implies (H2).
Proposition 3.1
Let be inf-compact, and suppose is bounded below in and satisfies
[TABLE]
then for all .
Proof 22
Let . Then , and Eq. 3.12 gives
[TABLE]
Now apply the Itô–Krylov formula to Eq. 3.13 followed by Fatou’s lemma to obtain
[TABLE]
Taking logarithm on both sides, diving by and letting , we obtain . \qed
Example 3.2
Let and where is bounded and
[TABLE]
Then we take for . It is easy to check that for a suitable choice of , Eq. 3.12 holds for .
Remark 3.4
Equation 3.12* is a stronger condition than strict monotonicity of at . In fact, Eq. 3.12 might not hold in many important situations. For instance, if and are both bounded, and is uniformly elliptic, then it is not possible to find inf-compact satisfying Eq. 3.12. Otherwise, we can find a finite principal eigenvalue for the operator , by a same method as in Eq. 3.7, which would contradict [5, Proposition 2.6].*
Even though Eq. 3.12 does not hold for bounded and , strict monotonicity of at can be asserted under suitable hypotheses. This is the subject of the following theorem.
Theorem 3.3
Let such that , satisfying
[TABLE]
for some compact set and positive constants and . Let be a nonnegative bounded measurable function with . Then for any , we have for all .
Proof 23
Let . Suppose . Applying an argument similar to Eq. 3.7 we can find and that satisfy
[TABLE]
Let be any compact set such that on . If denotes the first hitting time to the compact set , then by an application of the Itô–Krylov formula to Eq. 3.14 we obtain
[TABLE]
We next use the fact that if corresponds to a recurrent diffusion and is nonnegative then . Indeed, in this case we have . If , this implies that is a nonnegative supermartingale and since it is integrable, it converges a.s. Since the process is recurrent, this implies that must equal to a constant, which, in turn, necessitates that (and ). Thus, since , an argument similar to the proof of Lemma 2.2 (ii) shows that
[TABLE]
for . Therefore, applying the strong maximum principle as in Theorem 3.2, we obtain which is a contradiction since and . Thus we have . That for all follows by an argument similar to the one used in the proof Theorem 3.2. \qed
Example 3.3
Suppose , where denotes the identity matrix, and
[TABLE]
With for , we have
[TABLE]
4 Risk-sensitive control
In this section we apply the results developed in the previous sections to the risk-sensitive control problem. As mentioned earlier, we establish the existence and uniqueness of solutions to the risk-sensitive HJB equation, and use this to completely characterize the optimal Markov controls (see Theorems 4.1 and 4.2). Another interesting result is the continuity of the controlled principal eigenvalue with respect to the stationary Markov controls. This is done in Theorem 4.3. We first introduce the control problem.
4.1 The controlled diffusion model
Consider a controlled diffusion process which takes values in the -dimensional Euclidean space , and is governed by the Itô equation
[TABLE]
All random processes in Eq. 4.1 live in a complete probability space . The process is a -dimensional standard Wiener process independent of the initial condition . The control process takes values in a compact, metrizable set , and is jointly measurable in . The set of admissible controls consists of the control processes that are non-anticipative: for , is independent of
[TABLE]
We impose the following standard assumptions on the drift and the diffusion matrix to guarantee existence and uniqueness of solutions.
- (B1)
Local Lipschitz continuity: The functions and are continuous, and satisfy
[TABLE]
for some constant depending on .
- (B2)
Affine growth condition: For some , we have
[TABLE]
- (B3)
Nondegeneracy: Assumption (A3) in Subsection 1.1 holds.
It is well known that under (B1)–(B3), for any admissible control there exists a unique solution of Eq. 4.1 [34, Theorem 2.2.4]. We define the family of operators , where plays the role of a parameter, by
[TABLE]
The risk-sensitive criterion
Let denote the class of functions in that are locally Lipschitz in uniformly with respect to . We let denote the running cost function, and for any admissible control , we define the risk-sensitive objective function by
[TABLE]
We also define .
4.2 Relaxed controls
We adopt the well-known relaxed control framework [34]. According to this relaxation, a stationary Markov control is a measurable map from to , the latter denoting the set of probability measures on under the Prokhorov topology. Let denote the class of all such stationary Markov controls. A control may be viewed as a kernel on , which we write as . We say that a control is precise if it is a measurable map from to . We extend the definition of and as follows. For we let
[TABLE]
It is easy to see from (B2) and Jensen’s inequality that
[TABLE]
For , consider the relaxed diffusion
[TABLE]
It is well known that under Eq. 4.3 has a unique strong solution [42], which is also a strong Markov process. It also follows from the work in [41] that under , the transition probabilities of have densities which are locally Hölder continuous. Thus defined by
[TABLE]
for , is the generator of a strongly-continuous semigroup on , which is strong Feller. We let denote the probability measure and the expectation operator on the canonical space of the process under the control , conditioned on the process starting from at . We denote by the subset of that consists of stable controls, i.e., under which the controlled process is positive recurrent, and by the invariant probability measure of the process under the control .
Definition 4.1
For and a locally bounded measurable function , we let denote the principal eigenvalue of the operator on (see Definition 2.1).
We also adapt the notation in Eq. 2.4 to the control setting, and define
[TABLE]
We refer to as the risk-sensitive average of under the control .
Recall the risk-sensitive objective function defined in Eq. 4.2 and the optimal value . We say that a stationary Markov control is optimal (for the risk-sensitive criterion) if for all , and we let denote the class of these controls.
4.3 Optimal Markov controls and the risk-sensitive HJB
We start with the following assumption.
Assumption 4.1** (uniform exponential ergodicity)**
There exists an inf-compact function and a positive function , satisfying , such that
[TABLE]
for some constant , and a compact set .
It is easy to see that for we obtain from Eq. 4.4 that
[TABLE]
and therefore, applying the Itô–Krylov formula, we have for any stationary Markov control , and all .
Example 4.1
Let be bounded and be such that
[TABLE]
Then as seen in Example 3.2, , for , satisfies Eq. 4.4 for sufficiently small , and . Note that and is considered in [23].
We introduce the class of running costs defined by
[TABLE]
The first important result of this section is the following.
Theorem 4.1
Suppose Assumption 4.1 holds, and . Then does not depend on , and there exists a positive solution satisfying
[TABLE]
In addition, if denotes the class of Markov controls which satisfy
[TABLE]
then the following hold.
, and it holds that for all ; 2. 2.
; 3. 3.
Equation 4.5* has a unique positive solution in (up to a multiplicative constant).*
Proof 24
Using a standard argument (see [26, 28, 10]) we can find a pair , with on , and , that satisfies
[TABLE]
This is obtained as a limit of Dirichlet eigensolutions (\widehat{V}_{n},\hat{\lambda}_{n})\in\bigl{(}\mathscr{W}_{\mathrm{loc}}^{2,p}(B_{n})\cap C(\bar{B}_{n})\bigr{)}\times\mathbb{R}, for any , satisfying on , on , , and
[TABLE]
For we have
[TABLE]
By Corollary 2.1 we obtain . Also by Theorem 3.2 we have for all . Combining these estimates with Eq. 4.6 we obtain
[TABLE]
This of course shows that for all , and also proves part (a).
We continue with part (b). By Theorem 3.2 we have
[TABLE]
In turn, by Lemma 2.4 there exists a unique eigenfunction which is associated with the principal eigenvalue of the operator . Since for all by part (a), it follows by Eq. 4.7 that
[TABLE]
By Eq. 4.8 and Lemma 2.2 (ii), and since Eq. 4.3 is recurrent, we have
[TABLE]
and all sufficiently large balls centered at [math], where , as usual.
Since the Dirichlet eigenvalues satisfy for all , the Dirichlet problem
[TABLE]
with , has a unique solution , for any [4, Theorem 1.9] (see also [43, Theorem 1.1 (ii)]). We choose as follows: first select such that the solution of Eq. 4.11 with satisfies , and then set . Passing to the limit in Eq. 4.11 as along a subsequence, we obtain a nonnegative solution of
[TABLE]
It is evident from the construction that if then . On the other hand, if , then necessarily is positive on . Let be a selector from the minimizer of Eq. 4.12. If , then Eq. 4.12 implies that there exists such that . Since for all by Theorem 3.2, and , then, in view of Corollary 2.1, this contradicts Eq. 4.8 and the convexity of . Therefore, we must have . Let . Applying the Itô–Krylov formula to Eq. 4.11 we obtain
[TABLE]
and for all , where . Using the argument in the proof of [10, Lemma 2.11], we obtain
[TABLE]
Comparing Eq. 4.10 and Eq. 4.13, it follows that, given any , we can scale by a positive constant so that it touches from above at some point in . However, satisfies
[TABLE]
by Eq. 4.12. Thus we have
[TABLE]
and it follows by the strong maximum principle that for all . Since by part (a), it then follows by Eq. 4.9 that for all . Thus we have
[TABLE]
This proves the verification of optimality result in part (b).
Suppose now that is a positive solution of
[TABLE]
Let be a selector from the minimizer of Eq. 4.14. We have for all by Theorem 3.2 and the definition of , and by Corollary 2.1. Thus for all , which implies that . Then by the uniqueness of the latter. Therefore, by part (b). This completes the proof. \qed
As mentioned in Remark 3.4 the existence of an inf-compact in Assumption 4.1 is not possible when and are bounded. So we consider the following alternative assumption.
Assumption 4.2
There exists a function , such that , a compact set , and positive constants and , satisfying
[TABLE]
A similar assumption is used in [26] where the author has obtained only the existence of the solution to the HJB, and an optimal control. Also it is shown in [26] that there exists a constant , depending on , such that if , then Eq. 4.15 below has a solution. We improve these results substantially by proving uniqueness of the solution , and verification of optimality.
Theorem 4.2
Under Assumption 4.2, there exists a positive solution satisfying
[TABLE]
Let be as in Theorem 4.1. Then (a) and (b) of Theorem 4.1 hold, and Eq. 4.15 has a unique positive solution in up to a multiplicative constant.
Proof 25
Part (a) follows exactly as in the proof of Theorem 4.1.
By Theorems 3.3 and 2.7 for any there exists a unique eigenpair for . In addition,
[TABLE]
The rest follows as in Theorem 4.1. \qed
4.4 Continuity results
It is known from [34] that the set of relaxed stationary Markov controls is compactly metrizable (see also [44] for a detailed construction of this topology). In particular in if and only if
[TABLE]
for all and . For we denote by the principal eigenpair of the operator , i.e.,
[TABLE]
When , we occasionally drop the dependence on and denote the eigenvalue as . The next result concerns the continuity of with respect to stationary Markov controls, and extends the result in [5, Proposition 9.2]. The continuity result in [5, Proposition 9.2] is established with respect to the norm convergence of the coefficients, whereas Theorem 4.3 that follows asserts continuity under a much weaker topology.
Theorem 4.3
Assume one of the following.
Assumption 4.1* holds, and for some .* 2. 2.
Assumption 4.2* holds.*
Then the map is continuous.
Proof 26
We demonstrate the result under (i). For case (ii) the proof is analogous. Let in the topology of Markov controls. Let be the principal eigenpair which satisfies
[TABLE]
where the equality is a consequence of Theorems 3.2 and 3.1. It is obvious that for all .
Since is inf-compact, we can find a constant such that . Recall that for all (as shown in the paragraph after Assumption 4.1), and this implies that for all . Thus is bounded. Therefore, passing to a subsequence we may assume that as . To complete the proof we only need to show that . Since for all , and the coefficients , and are uniformly locally bounded, applying Harnack’s inequality and Sobolev’s estimate we can find , , such that weakly in . Therefore, by [34, Lemma 2.4.3] and Eq. 4.16, we obtain
[TABLE]
By Corollary 2.1 we have .
Let be an open ball such that for all , and be large enough so that . Let . Applying the Itô–Krylov formula to Eq. 4.17, we obtain
[TABLE]
for any . Since
[TABLE]
and in bounded in , for every fixed , letting in Eq. 4.18 we have
[TABLE]
26* which also holds, possibly for a larger ball , if we replace and with and , respectively, shows that, for some constant , we have \Psi_{n}(x)\leq\tilde{\kappa}\bigl{(}\mathscr{V}(x)\bigr{)}^{\beta} for all , and . Therefore, \Psi(x)\leq\tilde{\kappa}\bigl{(}\mathscr{V}(x)\bigr{)}^{\beta} for all .*
We write
[TABLE]
The left hand side of Eq. 4.21 and the first term on the right hand side both converge to \operatorname{\mathbb{E}}_{x}^{v}\bigl{[}\mathrm{e}^{\int_{0}^{\breve{\uptau}}\ell(X_{s})\,\mathrm{d}{s}}\bigr{]} as , by monotone convergence. Thus we have
[TABLE]
On the other hand Assumption 4.1 implies that
[TABLE]
We proceed as in the proof of Theorem 2.7. Let for . Since on , we have \mathscr{V}^{\beta-1}\leq\bigl{(}\tfrac{\Psi}{\tilde{\kappa}}\bigr{)}^{1-\frac{1}{\beta}} on , and, therefore,
[TABLE]
Thus, using Eq. 4.23, we obtain
[TABLE]
and by first letting , using Eq. 4.22, and then , it follows that the left hand side of 26 vanishes as . Therefore, letting in Eq. 4.20, we obtain
[TABLE]
It then follows by Corollary 2.3 that , and this completes the proof. \qed
Remark 4.1
Following the proof of Theorem 4.3 we can obtain the following continuity result which should be compared with [5, Proposition 9.2 (ii)]. Consider a sequence of operators with coefficients , where , are locally bounded uniformly in , and . The coefficients and are assumed to satisfy (A1)–(A3) uniformly in . Assume that in , and and weakly in . Moreover we suppose that one of the following hold.
There exists an inf-compact function and , with , such that a.e. on for some constant , and a compact set . In addition, is inf-compact for some . 2. 2.
The sequence satisfies Eq. 3.14 for all , , and
[TABLE]
Then the principal eigenvalue converges to as .
As an application of Theorem 4.3 we have the following existence result for the risk-sensitive control problem under (Markovian) risk-sensitive type constraints.
Theorem 4.4
Assume one of the following.
Assumption 4.1* holds, and for some .* 2. 2.
Assumption 4.2* holds, and satisfy*
[TABLE]
In addition, suppose that , , are closed subsets of , and that there exists such that for all , where we use the usual notation r_{i,v}(x)\coloneqq r_{i}\bigl{(}x,v(x)\bigr{)}.
Then the following constrained minimization problem admits an optimal control in
[TABLE]
Proof 27
Let be a sequence of controls along which the constraints are met, and converges to its infimum. Since is compact under the topology of Markov controls, we may assume, without loss of generality, that converges to some as . By Theorem 4.3 we know that , and , , are continuous maps, and that , and for . It follows that the constraints are met at . Therefore, is an optimal Markov control for the constrained problem. \qed
Another application of Theorem 4.3 is a following characterization of which provides a positive answer to [5, Conjecture 1.8] for a certain class of and . In Theorem 4.5 below, we consider the uncontrolled generator in Section 3. Let us introduce the following definition from [5]
[TABLE]
Recall the definition of in Eq. 2.46. From [5, Theorem 1.7], under (A1)–(A2), we have whenever is bounded above. It is conjectured in [5, Conjecture 1.8] that for bounded , , and , one has . It should be noted from Example 3.1 that could be strictly smaller than . The following result complements those in [5, Theorems 1.7 and 1.9].
Theorem 4.5
For a potential the following are true.
Suppose that . Then under (A1)–(A3) we have
[TABLE] 2. 2.
Let , and satisfy Eq. 3.14, and suppose that . Then . 3. 3.
Let , and satisfy Eq. 4.4, and suppose that is inf-compact for some . Then .
Proof 28
We first show (i). By [5, Theorem 1.7 (ii)] we have . Let , , be such that
[TABLE]
Recall that is the exit time from the open ball . Therefore, applying the Itô–Krylov formula, we obtain
[TABLE]
Since is finite, letting in Eq. 4.25, taking logarithms on both sides, dividing by and then letting we obtain . This implies . Now suppose , with , satisfies
[TABLE]
Repeating the analogous calculation as above, we obtain , which implies that .
Next we prove (ii). Since for any constant , we may replace by . Therefore, is non-negative and . By (i) we have . Let be a cut-off function such that for , and for . Define . Let \bigl{(}\Psi^{*}_{n},\lambda^{\!*}(f_{n})\bigr{)} denote the principal eigenpair of . By Remark 4.1 we have as . Thus to complete the proof it is enough to show that , which implies that for all , and thus . Note that for all . Now fix and let be the first hitting time to the ball . Then applying the Itô–Krylov formula to
[TABLE]
together with Fatou’s lemma, we have
[TABLE]
for all . Hence which completes the proof.
The proof of (iii) is completely analogous to the proof of part (ii). Since is inf-compact, we can find , such that , and is inf-compact. We let . Note that
[TABLE]
is inf-compact. On the other hand, for all . The rest follows as part (ii). \qed
Acknowledgements
The research of Ari Arapostathis was supported in part by the Army Research Office through grant W911NF-17-1-001, in part by the National Science Foundation through grant DMS-1715210, and in part by the Office of Naval Research through grant N00014-16-1-2956. The research of Anup Biswas was supported in part by an INSPIRE faculty fellowship, and a DST-SERB grant EMR/2016/004810.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. G. Kreĭn, M. A. Rutman, Linear operators leaving invariant a cone in a Banach space, Amer. Math. Soc. Translation 1950 (26) (1950) 128.
- 2[2] R. G. Pinsky, Positive harmonic functions and diffusion, Vol. 45 of Cambridge Studies in Advanced Mathematics, Cambridge University Press, Cambridge, 1995.
- 3[3] H. Berestycki, L. Nirenberg, S. R. S. Varadhan, The principal eigenvalue and maximum principle for second-order elliptic operators in general domains, Comm. Pure Appl. Math. 47 (1) (1994) 47–92. doi:10.1002/cpa.3160470105 . · doi ↗
- 4[4] A. Quaas, B. Sirakov, Principal eigenvalues and the Dirichlet problem for fully nonlinear elliptic operators, Adv. Math. 218 (1) (2008) 105–135. doi:10.1016/j.aim.2007.12.002 . · doi ↗
- 5[5] H. Berestycki, L. Rossi, Generalizations and properties of the principal eigenvalue of elliptic operators in unbounded domains, Comm. Pure Appl. Math. 68 (6) (2015) 1014–1065. doi:10.1002/cpa.21536 . · doi ↗
- 6[6] Y. Furusho, Y. Ogura, On the existence of bounded positive solutions of semilinear elliptic equations in exterior domains, Duke Math. J. 48 (3) (1981) 497–521. doi:10.1215/S 0012-7094-81-04828-6 . · doi ↗
- 7[7] Y. Pinchover, On positive solutions of second-order elliptic equations, stability results, and classification, Duke Math. J. 57 (3) (1988) 955–980. doi:10.1215/S 0012-7094-88-05743-2 . · doi ↗
- 8[8] H. Berestycki, L. Rossi, On the principal eigenvalue of elliptic operators in ℝ N superscript ℝ 𝑁 \mathbb{R}^{N} and applications, J. Eur. Math. Soc. (JEMS) 8 (2) (2006) 195–215. doi:10.4171/JEMS/47 . · doi ↗
