A variational characterization of the risk-sensitive average reward for controlled diffusions on $\mathbb{R}^d$
Ari Arapostathis, Anup Biswas, Vivek S. Borkar, K. Suresh Kumar

TL;DR
This paper develops a variational framework for the risk-sensitive reward problem in controlled diffusions on bd, linking it to eigenvalues of associated operators and extending results to unbounded drifts and costs.
Contribution
It introduces a variational formula for the risk-sensitive value and connects it to the principal eigenvalue of a semilinear operator, extending previous results.
Findings
Established a variational formula on bd for the risk-sensitive reward.
Showed the risk-sensitive value equals the generalized principal eigenvalue.
Extended results to unbounded drifts and costs using a new gradient estimate.
Abstract
We address the variational formulation of the risk-sensitive reward problem for non-degenerate diffusions on controlled through the drift. We establish a variational formula on the whole space and also show that the risk-sensitive value equals the generalized principal eigenvalue of the semilinear operator. This can be viewed as a controlled version of the variational formulas for principal eigenvalues of diffusion operators arising in large deviations. We also revisit the average risk-sensitive minimization problem and by employing a gradient estimate developed in this paper, we extend earlier results to unbounded drifts and running costs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\newsiamremark
remarkRemark \newsiamremarkassumptionAssumption \newsiamremarknotationNotation \newsiamremarkexampleExample
\headersA variational formula for risk-sensitive controlA. Arapostathis, A. Biswas, V.S. Borkar, and K. Suresh Kumar
A variational characterization of the risk-sensitive
average reward for controlled diffusions on .
Ari Arapostathis Department of Electrical and Computer Engineering, The University of Texas at Austin, EER 7.824, Austin, TX 78712 (). [email protected]
Anup Biswas Department of Mathematics, Indian Institute of Science Education and Research, Dr. Homi Bhabha Road, Pune 411008, India (). [email protected]
Vivek S. Borkar Department of Electrical Engineering, Indian Institute of Technology, Powai, Mumbai 400076, India (). [email protected]
K. Suresh Kumar Department of Mathematics, Indian Institute of Technology, Powai, Mumbai 400076, India (). [email protected]
Abstract
We address the variational formulation of the risk-sensitive reward problem for non-degenerate diffusions on controlled through the drift. We establish a variational formula on the whole space and also show that the risk-sensitive value equals the generalized principal eigenvalue of the semilinear operator. This can be viewed as a controlled version of the variational formulas for principal eigenvalues of diffusion operators arising in large deviations. We also revisit the average risk-sensitive minimization problem and by employing a gradient estimate developed in this paper we extend earlier results to unbounded drifts and running costs.
keywords:
principal eigenvalue, Donsker–Varadhan functional, risk-sensitive criterion
{AMS}
60J60, Secondary 60J25, 35K59, 35P15, 60F10
1 Introduction
In this paper we consider the risk-sensitive reward maximization problem on for diffusions controlled through the drift. The main objective is to derive a variational formulation for the risk-sensitive reward in the spirit of [2], which does so for discrete time problems on a compact state space, and analyze the associated Hamilton–Jacobi–Bellman (HJB) equation. Since the seminal work of Donsker and Varadhan [18, 19], this problem has acquired prominence. The variational formula derived here can be viewed as a controlled version of the variational formulas for principal eigenvalues of diffusion operators arising in large deviations. For reversible diffusions, this formula can be viewed as an abstract Courant–Fischer formula [18]. For general diffusions, the correct counterpart in linear algebra is the Collatz–Wielandt formula for the principal eigenvalue of non-negative matrices [27, Chapter 8]. For its connection with the large deviations theory for finite Markov chains and an equivalent variational description, see [17].
There has been considerable interest to generalize this theory to a natural class of nonlinear self-maps on positive cones of finite or infinite dimensional spaces. The first task is to establish the existence and where possible, uniqueness of the principal eigenvalue and eigenvector (the latter modulo a scalar multiple as usual), that is, a nonlinear variant of the Perron–Frobenius theorem in the finite dimensional case and its generalization, the Krein–Rutman theorem, in Banach spaces. This theory is carried out in, e.g., [25, 29]. The next problem is to derive an abstract Collatz–Wielandt formula for the principal eigenvalue [1]. In bounded domains, a Collatz–Wielandt formula for the Dirichlet principal eigenvalue of a convex nonlinear operator is obtained in [10]. Our first objective coincides with this, albeit for Feynman–Kac operators arising in risk-sensitive control that we introduce later. For risk-sensitive reward processes, that is, the problem of maximizing the asymptotic growth rate for the risk-sensitive reward in discrete time problems, one can go a step further and give an explicit characterization of the principal eigenvalue as the solution of a concave maximization problem [2]. The objective of this article is to carry out this program for controlled diffusions.
At this juncture, it is worthwhile to underscore the difference between reward maximization and cost minimization problems with risk-sensitive criteria. Unlike the more classical criteria such as ergodic or discounted, they cannot be converted from one to the other by a sign flip. The cost minimization criterion, after a logarithmic transformation applied to its HJB equation, leads to the Isaacs equation for a zero-sum stochastic differential game [20]. An identical procedure applied to the reward maximization problem would lead to a team problem wherein the two agents seek to maximize the same payoff non-cooperatively. The latter in particular implies that their decisions at any time are conditionally independent given the state (more generally, the past history). Our approach leads to a concave maximization problem, an immense improvement with potential implications for possible numerical schemes. This does not seem possible for the cost minimization problem. Thus the complexity of the latter is much higher. Recently, a risk-sensitive maximization problem is also studied in [14] under a blanket geometric stability condition. In the present paper we do not impose any blanket stability on the controlled processes.
We first establish these results for reflected diffusions in a bounded domain, for which the nonlinear Krein–Rutman theorem of [29] paves the way. This is not so if the state space is all of . Extension to the whole space turns out to be quite involved due to the lack of compactness. Even the well-posedness of the underlying nonlinear eigenvalue problem is pretty tricky. Hence we proceed via the infinite volume limit of the finite volume problems. This leads to an abstract Collatz–Wielandt formula and an abstract Donsker–Varadhan formula. More specifically, in Theorem 3.4 we show that the generalized eigenvalue of the semilinear operator is simple, and identify some useful properties of its eigenvector. We proceed to prove equality between the risk-sensitive value and the generalized principal eigenvalue in Theorem 3.8, which also establishes a verification of optimality criterion. The general result for the variational formula is in Proposition 4.1, followed by more specialized results in Theorems 4.11 and 4.15. In the process of deriving these results, we present some techniques that may have wider applicability. Most prominent of these is perhaps the gradient estimate in Lemma 4.5 for operators with measurable coefficients.
Lastly, in Section 5 we revisit the risk-sensitive minimization problem, and with the aid of Lemma 4.5 we improve the main result in [3] by extending it to unbounded drifts and running costs, under suitable growth conditions (see Section 5).
1.1 A brief summary of the main results
We summarize here the results concerning the variational formula on the whole space. We consider a controlled diffusion in of the form
[TABLE]
defined in a complete probability space . The process is a -dimensional standard Wiener process independent of the initial condition , and the control process lives in a compact metrizable space . We impose a standard set of assumptions on the coefficients which guarantee existence and uniqueness of strong solutions under all admissible controls. Namely, local Lipschitz continuity in and at most affine growth of and , and local non-degeneracy of (see Section 3 (i)). But we do not impose any ergodicity assumptions on the controlled diffusion. The process could be transient.
We let be a continuous running reward function, which is assumed bounded from above, and define the optimal risk-sensitive value by
[TABLE]
where the supremum is over all admissible controls, and denotes the expectation operator. This problem is translated to an ergodic control problem for the operator , defined by
[TABLE]
where denotes the Hessian, and , that seeks to maximize the average value of the functional
[TABLE]
We first show that the generalized principal eigenvalue (see Eq. 37) of the maximal operator
[TABLE]
is simple. An important hypothesis for this is that is negative and bounded from above away from zero on the complement of some compact set (see Section 3 (iii)). This is always satisfied if is an inf-compact function (i.e., the sublevel sets are compact, or empty, in for each ), or if is a positive function vanishing at infinity and the process is recurrent under some stationary Markov control. Let the positive function , normalized as to render it unique, denote the principal eigenvector, that is, , and define . The function
[TABLE]
plays a very important role in the analysis, and can be interpreted as an infinitesimal relative entropy rate (see Section 4). To keep the notation simple, we define , and use the single variable . Let denote the set of probability measures on the Borel -algebra of , and denote the set of infinitesimal ergodic occupation measures for the operator defined by
[TABLE]
where is the class of functions in which have compact support. We also define
[TABLE]
Then, under the mild hypotheses of Section 3, we show in Proposition 4.1 that
[TABLE]
We next specialize the results to the case where the diffusion matrix is bounded and uniformly elliptic (see Section 4), and show in Theorem 4.11 that under any of the hypotheses of Section 4 we have . This permits us to replace with and with in the second and third equalities of Eq. 7, respectively. We note here that if is bounded and uniformly elliptic, then Section 4 is satisfied when either is inf-compact, or has subquadratic growth, or is bounded.
We also show that if is bounded (see Lemma 4.13 for explicit conditions on the parameters under which this holds), then we can commute the ‘’ and the ‘’ to obtain
[TABLE]
Also, in Theorem 4.15, we establish the variational formula over the class of functions in whose partial derivatives up to second order have at most polynomial growth in .
1.2 Notation
The standard Euclidean norm in is denoted by , and stands for the set of natural numbers. The closure, the boundary and the complement of a set are denoted by , and , respectively. We denote by the first exit time of the process from the set , defined by
[TABLE]
The open ball of radius in , centered at , is denoted by , and is the ball centered at [math]. We let , and . For a Borel space , denotes the set of probability measures on its Borel -algebra.
The term domain in refers to a nonempty, connected open subset of the Euclidean space . For a domain , the space () refers to the class of all real-valued functions on whose partial derivatives up to order exist and are continuous (and bounded). In addition denotes the class of functions in that have compact support. The space , , stands for the Banach space of (equivalence classes of) measurable functions satisfying , and is the Banach space of functions that are essentially bounded in . The standard Sobolev space of functions on whose generalized derivatives up to order are in , equipped with its natural norm, is denoted by , , .
In general, if is a space of real-valued functions on , consists of all functions such that for every , the space of smooth functions on with compact support. In this manner we obtain for example the space .
We adopt the notation , and for , and , and use the standard summation rule that repeated subscripts and superscripts are summed from through .
2 The problem on a bounded domain
In this section, we consider the risk-sensitive reward maximization with state dynamics given by a reflected diffusion on a bounded domain with co-normal direction of reflection. In particular, the dynamics are given by
[TABLE]
where denotes the local time of the process on the boundary . The random processes in Eq. 8 live in a complete probability space . The process is a -dimensional standard Wiener process independent of the initial condition . The control process takes values in a compact, metrizable set , and is jointly measurable in . The set of admissible controls consists of the control processes that are non-anticipative: for , is independent of
[TABLE]
Concerning the coefficients of the equation, we assume the following:
- (i)
The drift is a continuous map from to , and Lipschitz in its first argument uniformly with respect to the second. 2. (ii)
The diffusion matrix is continuously differentiable, its derivatives are Hölder continuous, and is non-degenerate in the sense that the minimum eigenvalue of a(x)=\bigl{[}a^{ij}(x)\bigr{]}\coloneqq\upsigma(x)\upsigma^{\mathsf{T}}(x) on is bounded away from zero. 3. (iii)
The reflection direction is co-normal, that is, is given by
[TABLE]
where is the unit outward normal.
We let denote the set of stationary Markov controls, that is, the set of Borel measurable functions . Given , the stochastic differential equation in Eq. 8 has a unique strong solution. The same is true for the class of Markov controls [8, Chapter 2]. Let and denote the probability measure and expectation operator on the canonical space of the process controlled under , with initial condition .
Given a continuous reward function , which is Lipschitz continuous in its first argument uniformly with respect to the second, the objective of the risk-sensitive reward problem is to maximize
[TABLE]
over all admissible controls . We define
[TABLE]
The solution of this problem shows that does not depend on .
We let
[TABLE]
and denote its subspace consisting of nonnegative functions.
For , and , we define
[TABLE]
We summarize some results from [9] that are needed in Theorem 2.1 below. Without loss of generality we assume that .
Consider the operator , , defined by
[TABLE]
The characterization of is exactly analogous to [9, Theorem 3.2], which considers the minimization problem (see also [9, Remark 4.2]). Specifically, for each , and , the quasi-linear parabolic p.d.e. in , with for all , and for all , has a unique solution in {\mathcal{C}}^{1+\nicefrac{{\delta}}{{2}},2+\delta}\bigl{(}[0,T]\times\overline{Q}\bigr{)}. This solution has the stochastic representation for all .
Following the analysis in [9] we obtain the following characterization of defined in Eq. 11.
Theorem 2.1**.**
There exists a unique pair which solves
[TABLE]
Also, , for . In addition, we have
[TABLE]
and
[TABLE]
Proof 2.2**.**
Equation 14* is the result in [9, Lemma 2.1], while the other assertions follow from Lemma 4.5 and Remark 4.2 in [9]. *
2.1 A variational formula
Define
[TABLE]
and an operator by
[TABLE]
It is important to note that if is a positive function and , then
[TABLE]
Thus, we obtain from Eq. 14 that
[TABLE]
We let
[TABLE]
for and .
It is clear that Eq. 15 can be written as
[TABLE]
Let denote the class of infinitesimal ergodic occupation measures for the operator , defined by
[TABLE]
Implicit in this definition is the requirement that for all and . We have the following result.
Theorem 2.3**.**
It holds that
[TABLE]
Moreover, may be replaced with in Eq. 18, and thus
[TABLE]
Proof 2.4**.**
The first equality in Eq. 18 follows by Eq. 17. We continue to prove the rest of the assertions. First note that
[TABLE]
because the infimum on the left hand side is for . It follows by Eq. 17 that . Let be a measurable selector from the maximizer of Eq. 13, that is,
[TABLE]
With , Eq. 13 takes the form
[TABLE]
The reflected diffusion with drift b\bigl{(}x,v_{*}(x)\bigr{)}+a(x)\nabla\phi(x) is of course exponentially ergodic. Let denote its invariant probability measure. Then, Eq. 19 implies that
[TABLE]
Let be defined by
[TABLE]
where denotes the Dirac mass at . Then is an ergodic occupation measure for the controlled reflected diffusion with drift , and thus . Let be arbitrary. Then
[TABLE]
*where the second equality follows by Eq. 20. Thus , and since we have already asserted the reverse inequality, we must have equality. This establishes Eq. 18, and also proves the last assertion of the theorem. *
3 The risk-sensitive reward problem on
In this section we study the risk-sensitive reward maximization problem on . We consider a controlled diffusion of the form
[TABLE]
All random processes in Eq. 21 live in a complete probability space . The control process lives in a compact metrizable space .
We approach the problem in as a limit of Dirichlet or Neumann eigenvalue problems on balls , . Differentiability of the matrix can be relaxed here. Consider the eigenvalue problem on a ball , with Neumann boundary conditions, and the reflection direction along the exterior normal to at . The drift is continuous, and Lipschitz in its first argument uniformly with respect to the second. The diffusion matrix is Lipschitz continuous on and non-degenerate. Let denote the principal eigenvalue on under Neumann boundary conditions of the operator defined in Eq. 12. We refer to as the Neumann eigenvalue on . It follows from the results in [30] (see in particular Theorems 5.1, 6.6, and Proposition 7.1) that there exists a unique , with on and , solving
[TABLE]
and on . We also refer the reader to [24, Theorem 12.1, p. 195].
We adopt the following structural hypotheses on the coefficients of Eq. 21 and the reward function have the following structural properties.
{assumption}
- (i)
The drift is continuous, and for some constant depending on , we have
[TABLE]
where \lVert\upsigma\rVert\coloneqq\bigl{(}\operatorname*{trace}\,\upsigma\upsigma^{\mathsf{T}}\bigr{)}^{\nicefrac{{1}}{{2}}} denotes the Hilbert–Schmidt norm of . 2. (ii)
The reward function is continuous and locally Lipschitz in its first argument uniformly with respect to , is bounded from above in , and has polynomial growth in . 3. (iii)
We assume that the Neumann eigenvalues satisfy
[TABLE]
Section 3 is enforced throughout the rest of the paper, unless mentioned otherwise. Part (i) of this assumption are the usual hypotheses that guarantee existence and uniqueness of strong solutions to Eq. 21 under any admissible control.
Remark 3.1**.**
Equation 24* is a version of the near-monotone assumption, which is often used in ergodic control problems (see [8]). This has the effect of penalizing instability, ensuring tightness of laws for optimal controls. There are two important cases where Eq. 24 is always satisfied. First, when is inf-compact. In this case we have and , since the Dirichlet eigenvalues which are a lower bound for are increasing as a function of the domain [7, Lemma 2.1]. Second, when is positive and vanishes at infinity, and under some stationary Markov control the process in Eq. 21 is recurrent. This can be established by comparing with the Dirichlet eigenvalue on (see Section 3.2), and using [7, Theorems 2.6 and 2.7 (ii)]. For related studies concerning the class of running reward functions vanishing at infinity, albeit in the uncontrolled case, see [22, 23, 7, 10]. See also [4, Theorem 2.12] which studies the Collatz–Wielandt formula for the risk-sensitive minimization problem. *
Recall that denotes the set of stationary Markov controls. For , we use the simplifying notation
[TABLE]
and define analogously.
We next review some properties of eigenvalues of linear and semilinear operators on . For and , define
[TABLE]
with as in Eq. 12. Let . Suppose that a positive function and solve the equation
[TABLE]
We refer to any such solution as an eigenpair of the operator , and we say that is an eigenvector with eigenvalue . Note that by eigenvector we always mean a positive function. Let . We refer to the Itô stochastic differential equation
[TABLE]
as the twisted SDE, and to its solution as the twisted process corresponding to . Clearly is the extended generator of Eq. 27.
We define the generalized principal eigenvalue of the operator by
[TABLE]
A principal eigenvector is a positive solution of Eq. 26 with . A principal eigenvector is also called a ground state, and we refer to the corresponding twisted SDE and twisted process as a ground state SDE and ground state process respectively. Unlike what is common in criticality theory, our definition of a ground state does not require the minimal growth property of the principal eigenfunction (see [6]).
An easy calculation shows that any eigenpair of satisfies
[TABLE]
with . In other words, is an eigenpair of . Note also that is a solution to the ‘linear’ eigenvalue equation
[TABLE]
and that this equation can also be written as
[TABLE]
An extensive study of generalized principal eigenvalues with applications to risk-sensitive control can be found in [3, 7]. In these papers, the ‘potential’ is assumed to be bounded below in , so the results cannot be quoted directly. It is not our intention to reproduce all these results for potentials which are bounded above, so we only focus on results that are needed later in this paper. We only quote results in [3, 7] which do not depend on the assumption that is bounded below. Generally speaking, caution should be exercised with arguments in [3, 7] that employ the Fatou lemma. On the other hand, since usually appears in the exponent, invoking Fatou’s lemma hardly ever poses any problems.
Suppose that the twisted process in Eq. 27 is regular, that is, the solution exists for all times. Then, an application of [7, Lemma 2.3] shows that an eigenvector has the stochastic representation (semigroup property)
[TABLE]
Recall that denotes the first hitting time of the ball , for . We need the following lemma.
Lemma 3.2**.**
We assume only Section 3 (i)–(ii). The following hold.
- (a)
If is an eigenpair of under some , and the twisted process in Eq. 27 is exponentially ergodic, then we have the stochastic representation
[TABLE]
In addition, , the generalized principal eigenvalue of , and the ground state is unique up to multiplication by a positive constant. 2. (b)
Any eigenpair of satisfying Eq. 32 is a principal eigenpair, and is a simple eigenvalue.
Proof 3.3**.**
Combining the proof of [7, Theorem 2.2] with [7, Theorem 3.1], we deduce that for every , there exists a such that
[TABLE]
Applying the Itô formula to Eq. 26 we obtain
[TABLE]
We study separately the three integrals on the right-hand side of Eq. 34, which we denote as , . For the first integral we have
[TABLE]
by monotone convergence. Note that the limit is also finite by Eq. 33.
Let and denote the probability measure and expectation operator on the canonical space of the twisted process in Eq. 27 with initial condition . Next, using again the technique in [7, Theorem 2.2], we write
[TABLE]
where in the second inequality we apply [7, Lemma 2.3]. Thus, vanishes as .
Concerning , using monotone convergence, we obtain
[TABLE]
where the inequality follows from the proof of [7, Lemma 2.3]. In turn, the right-hand side of Eq. 35 vanishes as , since the twisted process is geometrically ergodic. This completes the proof of Eq. 32.
Suppose that a positive and solve
[TABLE]
An application of Itô’s formula and Fatou’s lemma then shows that
[TABLE]
Equations 32* and 36 imply that if we scale by multiplying it with a positive constant until it touches at one point from above, the function attains its minimum value of at some point in . A standard calculation shows that*
[TABLE]
Thus, must equal a constant by the strong maximum principle, which implies that . This of course means that . Uniqueness of is evident from the preceding argument. This completes the proof of part (a).
*Part (b) is evident from the preceding paragraph. This completes the proof. *
3.1 The Bellman equation in
Recall the solution of (22), the definition of in Eq. 24, and the definition of in Eq. 3. We define
[TABLE]
Recall the definitions of and in Eqs. 1 and 2. Note that if is an eigenpair of , then similarly to Eq. 31, we have
[TABLE]
with .
Theorem 3.4**.**
There exists satisfying
[TABLE]
and the following hold:
- (a)
The function is inf-compact. 2. (b)
If is an a.e. measurable selector from the maximizer of Eq. 39, then, the diffusion with extended generator , as defined in Eq. 25, is exponentially ergodic and satisfies
[TABLE]
with . 3. (c)
. 4. (d)
* and as uniformly on compact sets, and the solution to Eq. 39 is unique up to a scalar multiple, and satisfies*
[TABLE]
for all , and for all , with equality if and only if is an a.e. measurable selector from the maximizer in Eq. 39.
Proof 3.5**.**
Using Theorem 2.1 and (10)-(11), it follows that , and this combined with Section 3 (iii) shows that converges along some subsequence to . Therefore, the convergence of along some further subsequence to a satisfying Eq. 39 follows as in the proof of [13, Lemma 2.1].
We now turn to part (a). Here in fact we show that has at least logarithmic growth in . Let be a constant such that for all outside some compact set in . Consider a function of the form \phi(x)=\bigl{(}1+\lvert x\rvert^{2}\bigr{)}^{-\theta}, with . By Item (i), there exists and such that
[TABLE]
We fix such a constant . We restrict our attention to solutions of Eq. 22 over an increasing sequence in , also denoted as , such that converges to . It is clear then that we may enlarge the radius , if needed, so that
[TABLE]
Next, let be a convex function in such that for , and is constant and positive for . This can be chosen so that and . Such a function can be constructed by requiring, for example, that for , from which we obtain for . A simple calculation shows that . Note that for all by convexity. Let \breve{\chi}_{\epsilon}(t)\coloneqq\epsilon\breve{\chi}\bigl{(}\nicefrac{{t}}{{\epsilon}}\bigr{)} for . Then
[TABLE]
Using Eqs. 42, 43, and 44, we obtain
[TABLE]
For the last inequality in Eq. 45, we use the properties and from Eq. 44, that the fact that and . Note that, due to radial symmetry, the support of \breve{\chi}^{\prime}_{\epsilon}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}\phi is a ball of the form , with an nonincreasing continuous function with as . Recall the functions in Eq. 22. Select such that . Scale until it touches \breve{\chi}_{\epsilon}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}\phi at some point from below. Here, \breve{\chi}_{\epsilon}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}\phi denotes the composition of and . Let be a measurable selector from the minimizer in Eq. 22, and define h_{n}\coloneqq\breve{\chi}_{\epsilon}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}\phi-V_{n}. Then, by Eqs. 22 and 45, we have
[TABLE]
and on , since the gradient of \breve{\chi}_{\epsilon}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}\phi vanishes on . It follows by the strong maximum principle that cannot lie in the . Thus on this set. This implies that cannot lie on either, without contradicting the Hopf boundary point lemma. Thus . This however shows by taking limits as , and employing the Harnack inequality which asserts that for all for some constant , that for some constant . This proves part (a).
Equation 40* follows by Eq. 29. Since is inf-compact and the right hand side of Eq. 40 is negative and bounded away from zero outside a compact set by Section 3 (iii), the associated diffusion is ergodic [22, Theorem 4.1]. In turn, the Foster–Lyapunov equation in Eq. 40 shows that the diffusion is exponentially ergodic [28]. This proves part (b).*
Moving to the proof of part (c), suppose that for some we have
[TABLE]
Evaluating this equation at measurable selector from the maximizer of Eq. 39, and following the argument in the proof of Lemma 3.2 we obtain and . This also shows that by the definition in Eq. 37, and thus we have equality by Eq. 39.
*In order to prove part (d), suppose that along some subsequence. Taking limits along perhaps a further subsequence, we obtain a positive function that satisfies Eq. 46 with equality. Thus and and by part (c). The stochastic representation in Eq. 41 follows as in the proof of Lemma 3.2. This completes the proof. *
3.2 Dirichlet eigenvalues and the risk-sensitive value
In this section we first show that the problem in can also be approached by using Dirichlet eigensolutions. The main result is Theorem 3.8, which establishes that equals the risk-sensitive value , and the usual verification of optimality criterion.
We borrow some results from [11, 12]. These can also be found in [3, Lemma 2.2], and are summarized as follows: Fix any . For each there exists a unique pair (\Psi_{\mspace{-2.0mu}v,r},\lambda_{v,r})\in\bigl{(}{\mathscr{W}}^{2,p}(B_{r})\cap{\mathcal{C}}(\bar{B}_{r})\bigr{)}\times\mathds{R}, for any , satisfying on , on , and , which solves
[TABLE]
Moreover, the solution has the following properties:
- (i)
The map is continuous and strictly increasing. 2. (ii)
In its dependence on the function , is nondecreasing, convex, and Lipschitz continuous (with respect to the norm) with Lipschitz constant . In addition, if then .
We refer to and as the (Dirichlet) eigenvalue and eigenfunction, respectively, of the operator on .
Recall the definition of in Eq. 3. Based on the results in [31], there exists a unique pair (\Psi_{\mspace{-2.0mu}*,r},\lambda_{*,r})\in\bigl{(}{\mathcal{C}}^{2}(B_{r})\cap{\mathcal{C}}(\bar{B}_{r})\bigr{)}\times\mathds{R}, satisfying on , on , and , which solves
[TABLE]
and properties (i)–(ii) above hold for . Also recall the definitions of the generalized principal eigenvalues in Eqs. 28 and 37, and defined in Eq. 22.
Lemma 3.6**.**
The following hold:
- (i)
For , we have for all , and . 2. (ii)
* for all , and .*
Proof 3.7**.**
Part (i) is a straightforward application of the strong maximum principle. By Eqs. 12 and 48 we have
[TABLE]
Let , and suppose that . Scale so that it touches at one point from below in . Then is nonnegative, and by Eqs. 47 and 49 it satisfies
[TABLE]
This however implies that on which is a contradiction. Hence for all and the inequality follows by the continuity of . Following the same method, with , we obtain .
*Part (ii) follows by [7, Lemma 2.2 (ii)]. *
Recall the definitions in Eqs. 10 and 11, and let
[TABLE]
and similarly for and . Also, recall that
[TABLE]
The theorem that follows concerns the equality . Recall the definition in Eq. 24.
Theorem 3.8**.**
*We have . In addition, if and only if is an a.e. measurable selector from the maximizer of Eq. 39. *
Proof 3.9**.**
We already have from Theorem 3.4. This also gives
[TABLE]
Choose such that . This is possible by Eq. 24. Let be given, and select a smooth, non-negative cut-off function that vanishes in and equals to in . Let , and select small enough so that
[TABLE]
This is clearly possible since is positive and
[TABLE]
We have
[TABLE]
Since is bounded below away from zero, a standard use of Itô’s formula and the Fatou lemma applied to Eq. 50 shows that for all . Since is arbitrary this implies , and hence we must have equality. This also shows that every a.e. measurable selector from the maximizer of Eq. 39 is optimal.
Next, for , let be an eigenpair, obtained as a limit of Dirichlet eigenpairs \bigl{\{}(\lambda_{v,n},\Psi_{\mspace{-2.0mu}v,n})\bigr{\}}_{n\in\mathds{N}}, with , along some subsequence (see Lemma 3.6). Let be defined by
[TABLE]
First suppose that . Then, using the the argument in the preceding paragraph, together with the fact that , we deduce that for all . Thus if is optimal, we must have . This implies that we can select a ball such that
[TABLE]
for all sufficiently large . Let . By [3, Lemma 2.10 (i)], we have the stochastic representation
[TABLE]
Next we show that that vanishes at infinity by using the argument in the proof of Theorem 3.4. The analysis is simpler here. Selecting the same function as in the proof of Theorem 3.4, there exists such that
[TABLE]
Since , employing the Harnack inequality we scale so that on for all . The strong maximum principle then shows that on .
Thus is inf-compact, which together with the Lyapunov equation \widetilde{\mathcal{L}}^{\psi_{v}}_{v}\Psi_{\mspace{-2.0mu}v}^{-1}=\bigl{(}c_{v}-{\rho_{*}})\Psi_{\mspace{-2.0mu}v}^{-1} imply that the ground state process is exponentially ergodic. By Lemma 3.2, we then have
[TABLE]
On the other hand, it holds that , which implies that
[TABLE]
Comparing the functions in Eqs. 51 and 52 using the strong maximum principle, as done in the proof of Lemma 3.2, we deduce that . Thus is a measurable selector from the maximizer of Eq. 39.
It remains to address the case . By [6, Corollary 3.2] there exists a positive constant such that , and . Thus repeating the above argument we obtain
[TABLE]
*Therefore, cannot be optimal. This completes the proof. *
4 The variational formula on
In this section we establish the variational formula on . As mentioned in Section 1.1, the function in Eq. 4 plays a very important role in the analysis. To explain how this function arises, let denote the probability measure on the canonical path space of the diffusion Eq. 21 under a control , and the analogous probability measure corresponding to the diffusion
[TABLE]
with as in Theorem 3.4. By the Cameron–Martin–Girsanov theorem we obtain
[TABLE]
Thus, the relative entropy, or Kullback–Leibner divergence between and takes the form
[TABLE]
Dividing this by , and letting , we see that is the infinitesimal relative entropy rate.
Recall from Section 1.1 the definition , and the use of the single variable in the interest of notational simplicity. Also recall the definitions in Eqs. 5 and 6. Recall the definitions in Eqs. 1 and 2. In analogy to Eq. 16, we define
[TABLE]
The following result plays a central role in this paper.
Proposition 4.1**.**
We have
[TABLE]
*In addition, if , then may be replaced by in Eq. 53. *
In the proof of Proposition 4.1 and elsewhere in the paper we use a cut-off function defined as follows (compare this with the function in the proof of Theorem 3.4).
Definition 4.2**.**
*Let be a smooth convex function such that for , and for . Then and are nonnegative and the latter is supported on . It is clear that we can choose so that . We scale this function by defining for . Thus for , and for . Observe that if is an inf-compact function then is compactly supported by the definition of . *
Proof 4.3** (Proof of Proposition 4.1).**
We start with the first equality in Eq. 53. By Eq. 30, we have
[TABLE]
As shown in Theorem 3.4 the twisted process with extended generator is exponentially ergodic. Let denote its invariant probability measure. Since vanishes at infinity, and is a Lyapunov function by Eq. 40, it then follows from Eq. 54, by using the Itô formula and applying [8, Lemma 3.7.2 (ii)], that
[TABLE]
Next, we show that
[TABLE]
We write Eq. 39 as
[TABLE]
and using the identity
[TABLE]
to obtain (compare with Eq. 38)
[TABLE]
Using the function in Definition 4.2, the identity
[TABLE]
and the definition of , we obtain from Eq. 57 that
[TABLE]
Let , and without loss of generality assume that . The integral of the first term in Eq. 58 with respect to vanishes by the definition of . Thus, we have
[TABLE]
with . Since , then taking limits as in Eq. 59, using dominated convergence together with the fact that as , we see that the right-hand side of Eq. 59 goes to [math]. Also, using Fatou’s lemma and the fact that as , we obtain from Eq. 59 that
[TABLE]
This proves Eq. 56. Now, if we let
[TABLE]
then
[TABLE]
which implies that . Then, the second equality in Eq. 55 can be written as
[TABLE]
while the first equality in Eq. 55 together with the fact that is bounded above and is finite implies that . Therefore, , and the first equality in Eq. 53 now follows from Eqs. 56 and 61.
We now turn to the proof of the second equality in Eq. 53. Note that it then . On the other hand, if then, as also stated in the proof of Theorem 2.3, . The remaining case is , for which we have , thus proving the equality.
*The second statement of the proposition follows directly from the arguments used above. *
Remark 4.4**.**
One can follow the argument in the proof of [5, Theorem 1.4], using Radon–Nikodym derivatives instead of densities, to show that every maximizing infinitesimal ergodic occupation measure for Eq. 53 has the form
[TABLE]
where denotes the Dirac mass at , and is an optimal ergodic occupation measure of the diffusion associated with operator defined by
[TABLE]
*for and . We leave the verification of this assertion to the reader. *
We continue our analysis by investigating conditions on the model parameters which imply that . We impose the following hypothesis on the matrix .
{assumption}
The matrix is bounded and has a uniform modulus of continuity on , and is uniformly non-degenerate in the sense that the minimum eigenvalue of is bounded away from zero on .
We start with the following lemma, which can be viewed as a generalization of [3, Lemma 3.3]. Section 3, which applies by default throughout the paper, need not be enforced in this lemma.
Lemma 4.5**.**
Consider a linear operator in , of the form
[TABLE]
and suppose that the matrix satisfies Section 4, and the coefficients and are locally bounded and measurable. Then, there exists a constant such that any strong positive solution , , to the equation
[TABLE]
satisfies
[TABLE]
Proof 4.6**.**
We use scaling. For any fixed , with , we define
[TABLE]
and the scaled function
[TABLE]
and similarly for the functions , , and . The equation in Eq. 62 then takes the form
[TABLE]
It is clear from the hypotheses that the coefficients of Eq. 63 are bounded in the ball , with a bound independent of , and that the modulus of continuity and ellipticity constants of the matrix in are independent of . We follow the argument in [3, Lemma 3.3], which is repeated here for completeness. First, by the Harnack inequality [21, Theorem 9.1], there exists a positive constant independent of the point chosen, such that for all . Let
[TABLE]
By a well known a priori estimate [16, Lemma 5.3], there exists a constant , again independent of , such that,
[TABLE]
where in the last inequality, we used the Harnack property. Clearly then, the resulting constant does not depend on . Next, invoking Sobolev’s theorem, which asserts the compactness of the embedding {\mathscr{W}}^{2,p}\bigl{(}B_{1}(x_{0})\bigr{)}\hookrightarrow{\mathcal{C}}^{1,r}\bigl{(}B_{1}(x_{0})\bigr{)}, for and (see [16, Proposition 1.6]), and combining this with Eq. 64, we obtain
[TABLE]
for some constant independent of . Thus
[TABLE]
Using Eq. 65 and the identity for all , we obtain
[TABLE]
*Of course is arbitrary. The same is true with any radius, with perhaps a different constant. This completes the proof. *
Remark 4.7**.**
Lemma 4.5* should be compared with similar gradient estimates in the literature. Its benefit is that it matches or exceeds the estimates in [26, Lemma 5.1] and [15, Theorem A.2], without requiring any regularity on the coefficients. *
{assumption}
One of the following holds:
- (a)
The function is inf-compact. 2. (b)
The drift satisfies
[TABLE] 3. (c)
There exists a constant such that (compare this with [4, Theorem 3.1 (b)])
[TABLE]
where , and is as in Theorem 3.4.
Remark 4.8**.**
Section 4* (c) is not specified in terms of the parameters of the equation. However, Section 4 together with the hypothesis that is bounded implies Section 4 (c). This is asserted by Lemma 4.5. See also Lemma 4.13 later in this section. *
We have the following estimate concerning the growth of the function in Theorem 3.4. This does not require the uniform ellipticity hypothesis in Section 4.
Lemma 4.9**.**
Grant Section 4 part (a) or (b). Then there exists a function , with , such that the solution in Eq. 39 satisfies
[TABLE]
Proof 4.10**.**
We start with part (a). Let be a strictly increasing function, satisfying and as , and
[TABLE]
This is always possible. A specific function satisfying these properties is given by
[TABLE]
Let be a constant such that \bigl{\lvert}{\mathcal{L}}_{v_{*}}(\log\lvert x\rvert)\bigr{\rvert}\leq c_{1} for all . Such a constant exists since and have at most linear growth in by Item (i). We define
[TABLE]
Since the functions and are inf-compact, it is clear that as .
Define the family of functions
[TABLE]
Note that for any we have
[TABLE]
Thus, applying Eq. 70 and the bound \bigl{\lvert}{\mathcal{L}}_{v_{*}}(\log\lvert x\rvert)\bigr{\rvert}\leq c_{1}, we obtain
[TABLE]
Combining Eqs. 54 and 71, and completing the squares, we have
[TABLE]
Recall that , and . Choose large enough so that on . It then follows by the definitions in Eqs. 68 and 69 that {\varphi_{\mspace{-2.0mu}*}}-\chi_{t}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}h_{r}<0 on for all . Also, for each , the difference {\varphi_{\mspace{-2.0mu}*}}-\chi_{t}\mathbin{\mathchoice{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}{\vbox{\hbox{\scriptscriptstyle\circ}}}}h_{r} is negative outside some compact set by the inf-compactness of . Note also that on . Hence Items (i) and 69 imply that there exists such the right-hand side of Eq. 72 is negative on for all and all . An application of the strong maximum principle then shows that on for all .
Now, note that
[TABLE]
Since is strictly increasing, the inequality Eq. 67 holds with
[TABLE]
This completes the proof under Section 4 (a) .
The proof under part (b) of the assumption is similar. The only difference is that here we use the fact that m_{r}\,\coloneqq\,\sup_{x\in B_{r}^{c}}\,\bigl{(}{\mathcal{L}}_{v_{*}}(\log\lvert x\rvert)\bigr{)}^{-}\to 0 as , which is implied by Eq. 66. Thus with any constant such that outside some compact set, we choose as
[TABLE]
*The rest is completely analogous to the analysis above. This concludes the proof. *
The first part of the theorem which follows is quite technical, but identifies a rather deep property of the ergodic occupation measures of the operator . It shows that under Sections 4 and 4 (a) or (b), or Section 4 (c), if such a measure is feasible for the maximization problem, or in other words, it satisfies , then it necessarily has “finite average” entropy, that is , or equivalently, it belongs in the class . The proof uses the method of contradiction. We first show that if such a measure is not in the class , then the left hand side of Eq. 59 grows at a geometric rate as a function of . Then we obtain a contradiction by evaluating the right-hand side of Eq. 59 using this geometric growth together with the bound in Lemma 4.9.
Theorem 4.11**.**
- (i)
Under Sections 4 and 4 (a) or (b), or Section 4 (c), we have . This of course implies by Proposition 4.1 that
[TABLE] 2. (ii)
Let Section 4 hold, and suppose that
[TABLE]
Then
[TABLE]
Proof 4.12**.**
We first prove part (i) under under Section 4 (a) or (b). We argue by contradiction. Let , and suppose that . As in the proof of Proposition 4.1 we let . Let and denote the left and the right-hand side of Eq. 59, respectively, and define
[TABLE]
Then of course as by the hypothesis. Expanding we see that
[TABLE]
Since is finite, it follows that and are also finite. Moreover, the second assertion and the fact that is bounded above imply that . Thus, using the Cauchy–Schwarz inequality in the above display and the fact is bounded, we have
[TABLE]
for some constants and which are bounded in .
First suppose that over some sequence we have as . This implies by Eq. 75 that . However, if this is the case, then the inequality
[TABLE]
which is implied by Eqs. 59 and 75, contradicts the fact that as . Thus we must have , and same applies to the fraction .
Define
[TABLE]
We have for , by definition of these quantities. Recall that is defined as the right-hand side of Eq. 59. Note then that, since , we have for some . Therefore, since , there exists such that
[TABLE]
Thus , which implies that . This of course means that diverges at a geometric rate in , that is, . Let denote the inverse of the map . Note that for some positive constants and by Lemma 4.5 and the hypothesis that has polynomial growth in Section 3 (ii). Thus, by Lemma 4.9, we obtain
[TABLE]
for all . However, this implies from Eq. 76 that
[TABLE]
for some constant , and we reach a contradiction. Therefore, .
Moving on to the proof under Section 4 (c), we replace the function in Definition 4.2 by a function defined as follows. For , we let be a convex function such that for , and for . Then and are nonnegative. In addition, we select so that for and . This is always possible. We follow the same analysis as in the proof of Proposition 4.1, with the function as chosen, and obtain
[TABLE]
where . The integral on the right-hand side of Eq. 77 vanishes as by the hypothesis that , so again we obtain Eq. 60 which implies the result. This completes the proof of part (i).
We continue with part (ii). We use a convex function , for , satisfying for , for , and for , for some . We let h_{t}(x)=\hat{\chi}_{t}\bigl{(}{\varphi_{\mspace{-2.0mu}*}}(x)\bigr{)}. We may translate so that it is smaller than on . By (58), we have
[TABLE]
We claim that given any there exists such that for all . This of course suffices to establish Eq. 74.
By Section 3 (iii) there exists such that the first term on the right-hand side of Eq. 78 is nonpositive for all . Also, using the definition of , we have
[TABLE]
*by the hypothesis, and since is inf-compact by Theorem 3.4. This proves the claim, and completes the proof. *
There is a large class of problems which satisfy Eq. 73. It consists of equations with having at most linear growth in and growing no faster than . This fact is stated in the following lemma.
Lemma 4.13**.**
Grant Section 4 and suppose that
[TABLE]
*Then Eq. 73 holds. *
Proof 4.14**.**
We use the function in Definition 4.2. Let be such that on . Note that there exists a constant such that
[TABLE]
Thus for some small enough, using Eq. 54, we obtain
[TABLE]
An application of the strong maximum principle then shows that . Therefore, using Lemma 4.5, we obtain
[TABLE]
*for some constant . *
We next present the variational formula over functions in whose derivatives up to second order have at most polynomial growth in . Let denote this space of functions.
Theorem 4.15**.**
Under Section 3 alone, we have
[TABLE]
Under Sections 4 and 4 (a) or (b), we have
[TABLE]
Proof 4.16**.**
[TABLE]
Since , this implies that
[TABLE]
On the other hand, by Theorem 3.4 (d), it follows that for any we have
[TABLE]
which then implies the converse inequality
[TABLE]
This proves Eq. 79.
Concerning Eq. 80, the first equality follows as in the preceding paragraph since by Assumptions 3 (i)–(ii) and 4, and Lemma 4.5. Turning now our attention to the second equality in Eq. 80, recall from the proof of Proposition 4.1 that denotes the invariant probability measure of . Under Section 4 (a) or (b), Lemma 4.9 shows that grows faster in than any polynomial. Therefore, for all by Eq. 40. Since has at most polynomial growth, and has at most linear growth, we obtain
[TABLE]
Continuing, if Eq. 81 holds, then it is standard to show by employing a cut-off function, that
[TABLE]
Let denote the ergodic occupation measure corresponding to , that is,
[TABLE]
Equation 82* implies that*
[TABLE]
Since
[TABLE]
*the second equality in Eq. 80 then follows by Eqs. 79 and 83. *
5 The risk-sensitive cost minimization problem
Using Lemma 4.5, we can improve the main result in [3] which assumes bounded drift and running cost.
We say that a function defined on a locally compact space is coercive, or near-monotone, relative to a constant if there exists a compact set such that . Recall that an admissible control for Eq. 21 is a process which takes values in , is jointly measurable in , and is non-anticipative, that is, for , is independent of given in Eq. 9. We let denote the class of admissible controls, and the expectation operator on the canonical space of the process under the control , conditioned on the process starting from at .
Let be continuous, and Lipschitz continuous in its first argument uniformly with respect to the second. We define the risk-sensitive penalty by
[TABLE]
and the risk-sensitive optimal values by , and . Let
[TABLE]
and
[TABLE]
We say that is strictly monotone at on the right if for all non-trivial nonnegative functions with compact support.
Proposition 5.1 below improves [3, Proposition 1.1]. We first state the assumptions.
{assumption}
In addition to Section 4 we require the following.
- (i)
The drift and running cost satisfy, for some and a constant , the bound
[TABLE]
for all . 2. (ii)
The drift satisfies
[TABLE]
Proposition 5.1**.**
Grant Section 5, and suppose that is coercive relative to . Then the HJB equation
[TABLE]
has a solution , satisfying , and the following hold:
- (a)
* for all .* 2. (b)
Any that satisfies
[TABLE]
a.e. , is stable, and is optimal, that is, for all . 3. (c)
It holds that
[TABLE]
for any that satisfies Eq. 86. 4. (d)
If is strictly monotone at on the right, then there exists a unique positive solution to Eq. 85, up to a multiplicative constant, and any optimal satisfies Eq. 86.
Proof 5.2**.**
A modification of [3, Lemma 3.2] (e.g., applying Itô’s formula to the function ) shows that Eq. 84 implies that
[TABLE]
*From this point on, the proof follows as in [3], using Lemma 4.5. Indeed, parts (a) and (b) follow from [3, Theorem 3.4] by using the above estimate and Lemma 4.5. Since , any minimizing selector is recurrent. Moreover, the twisted diffusion corresponding to the minimizing selector is regular. Thus part (c) follows from [3, Theorem 1.5]. In addition, the hypothesis in (d) implies that for any minimizing selector , is right monotone at which, in turn, implies the simplicity of the principal eigenvalue by [3, Theorem 1.2]. This also implies the last claim by [3, Lemma 3.6]. *
Acknowledgements
The work of Ari Arapostathis was supported in part by the National Science Foundation through grant DMS-1715210, in part the Army Research Office through grant W911NF-17-1-001, and in part by the Office of Naval Research through grant N00014-16-1-2956 which was approved for public release under DCN #43-5025-19. The research of Anup Biswas was supported in part by an INSPIRE faculty fellowship and DST-SERB grant EMR/2016/004810, while the work of Vivek Borkar was supported by a J. C. Bose Fellowship.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Akian, S. Gaubert, and R. Nussbaum , A Collatz-Wielandt characterization of the spectral radius of order-preserving homogeneous maps on cones , ar Xiv e-prints, 1112.5968 (2011), https://arxiv.org/abs/1112.5968 .
- 2[2] V. Anantharam and V. S. Borkar , A variational formula for risk-sensitive reward , SIAM J. Control Optim., 55 (2017), pp. 961–988, https://doi.org/10.1137/151002630 . · doi ↗
- 3[3] A. Arapostathis and A. Biswas , Infinite horizon risk-sensitive control of diffusions without any blanket stability assumptions , Stochastic Process. Appl., 128 (2018), pp. 1485–1524, https://doi.org/10.1016/j.spa.2017.08.001 . · doi ↗
- 4[4] A. Arapostathis and A. Biswas , A variational formula for risk-sensitive control of diffusions in ℝ d superscript ℝ 𝑑 \mathbb{R}^{d} , SIAM J. Control Optim., 58 (2020), pp. 85–103, https://doi.org/10.1137/18M 1218704 . · doi ↗
- 5[5] A. Arapostathis, A. Biswas, and V. S. Borkar , Controlled equilibrium selection in stochastically perturbed dynamics , Ann. Probab., 46 (2018), pp. 2749–2799, https://doi.org/10.1214/17-AOP 1238 . · doi ↗
- 6[6] A. Arapostathis, A. Biswas, and D. Ganguly , Certain Liouville properties of eigenfunctions of elliptic operators , Trans. Amer. Math. Soc., 371 (2019), pp. 4377–4409, https://doi.org/10.1090/tran/7694 . · doi ↗
- 7[7] A. Arapostathis, A. Biswas, and S. Saha , Strict monotonicity of principal eigenvalues of elliptic operators in ℝ d superscript ℝ 𝑑 \mathbb{R}^{d} and risk-sensitive control , J. Math. Pures Appl. (9), 124 (2019), pp. 169–219, https://doi.org/10.1016/j.matpur.2018.05.008 . · doi ↗
- 8[8] A. Arapostathis, V. S. Borkar, and M. K. Ghosh , Ergodic control of diffusion processes , vol. 143 of Encyclopedia of Mathematics and its Applications, Cambridge University Press, Cambridge, 2012, https://doi.org/10.1017/CBO 9781139003605 . · doi ↗
