On uniqueness of solutions to viscous HJB equations with a subquadratic nonlinearity in the gradient
Ari Arapostathis, Anup Biswas, Luis Caffarelli

TL;DR
This paper proves the uniqueness of positive solutions to certain viscous Hamilton-Jacobi-Bellman equations with subquadratic gradient nonlinearities under mild growth conditions on the potential function, using a novel measure-based approach.
Contribution
It introduces a new method involving infinite dimensional linear programming for measures, establishing uniqueness without strong regularity assumptions.
Findings
Unique positive solution for a broad class of coercive functions
Method applicable to larger Hamiltonian classes studied by Ichihara
Verification of optimality in ergodic control problems without parabolic equations
Abstract
Uniqueness of positive solutions to viscous Hamilton-Jacobi-Bellman (HJB) equations of the form , with a coercive function and a constant, in the subquadratic case, that is, , appears to be an open problem. Barles and Meireles [Comm. Partial Differential Equations 41 (2016)] show uniqueness in the case that and for some , essentially matching earlier results of Ichihara, who considered more general Hamiltonians but with better regularity for . Without enforcing this assumption, to our knowledge, there are no results on uniqueness in the literature. In this short article, we show that the equation has a unique positive solution for any locally Lipschitz continuous, coercive which satisfies $|D f(x)| \le…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On uniqueness of solutions to viscous HJB equations
with a subquadratic nonlinearity in the gradient
Ari Arapostathis∗
∗ Department of Electrical and Computer Engineering, The University of Texas at Austin, 2501 Speedway, EER 7.824, Austin, TX 78712, USA
,
Anup Biswas*†*
† Department of Mathematics, Indian Institute of Science Education and Research, Dr. Homi Bhabha Road, Pune 411008, India
and
Luis Caffarelli*‡*
*‡*Department of Mathematics, The University of Texas at Austin, 2515 Speedway, RLM 10.150, Austin, TX 78712
Abstract.
Uniqueness of positive solutions to viscous Hamilton–Jacobi–Bellman (HJB) equations of the form , with a coercive function and a constant, in the subquadratic case, that is, , appears to be an open problem. Barles and Meireles [Comm. Partial Differential Equations 41 (2016)] show uniqueness in the case that and for some , essentially matching earlier results of Ichihara, who considered more general Hamiltonians but with better regularity for . Without enforcing this assumption, to our knowledge, there are no results on uniqueness in the literature. In this short article, we show that the equation has a unique positive solution for any locally Lipschitz continuous, coercive which satisfies \bigl{\lvert}Df(x)\bigr{\rvert}\leq\kappa\bigl{(}1+\lvert f(x)\rvert^{2-\nicefrac{{1}}{{\gamma}}}\bigr{)} for some positive constant . Since , this assumption imposes very mild restrictions on the growth of the potential . We also show that this solution fully characterizes optimality for the associated ergodic problem. Our method involves the study of an infinite dimensional linear program for elliptic equations for measures, and is very different from earlier approaches. It also applies to the larger class of Hamiltonians studied by Ichihara, and we show that it is well suited to provide verification of optimality results for the associated ergodic control problems, even in a pathwise sense, and without resorting to the parabolic problem.
Key words and phrases:
viscous Hamilton–Jacobi equations, infinitesimally invariant measures, ergodic control, convex duality
2000 Mathematics Subject Classification:
35J60, 35P30, 35B40, 35B50
1. Introduction
We consider the viscous Hamilton–Jacobi–Bellman (HJB) equation
[TABLE]
for , with . Here, and is coercive. By coercive, sometimes also called inf-compact, we refer to a function whose sublevel sets are compact (or empty) for every . As shown in [Barles-16], Eq. EP has a classical solution for any , where
[TABLE]
This equation which has a long history in the literature, has been studied in [Ichihara-12, Cirant-14] for somewhat more general Hamiltonians, and was recently revisited by Barles in [Barles-16]. What is of interest here, is to characterize the solutions of Eq. EP which are bounded from below, that is, without loss of generality, the positive solutions. Naturally, when we refer to this equation having a unique positive solution, we mean that the solution is unique up to an additive constant. In the superquadratic case (), [Barles-16, Theorem 2.6] shows that Eq. EP has a unique positive solution for any coercive , and in addition, for this solution, . In the subquadratic case (), [Barles-16] adopts assumption (H2) in [Ichihara-12], which states that satisfies a bound of the form
[TABLE]
for some positive constants and for all . Without enforcing this assumption, to our knowledge, there are no results on uniqueness in the literature, and therefore also no verification of optimality results. Note that [Cirant-14] introduces an additional stable drift to study the subquadratic case.
There is substantial literature on viscous HJB equations, other than [Barles-16, Ichihara-12, Cirant-14] mentioned above. It is not our intent to review this literature, since it does not address the problem studied in this paper, but we should at least mention [Barles-01, Ben-Fre, BenNag-91, Barles-10, Fujita-06a, Ichihara-13a, BenFre-92, Lasry-89].
We adopt the following assumption for .
- (A1)
The function is locally Lipschitz continuous and coercive, and there exists a constant such that
[TABLE]
We show that, under (A1), there exists a unique positive solution to Eq. EP. In addition, this solution fully characterizes the ergodic control problem in the sense that a stationary Markov control is optimal if and only if it agrees a.e. on with the function in Eq. 2.19 (see Theorem 4.1). The method we follow covers the more general Hamiltonians studied in [Ichihara-12, Cirant-14], and also improves the existing results for the superquadratic case. This is discussed in Section 3.
1.1. Brief summary of the method
Consider the operator defined by
[TABLE]
Let denote the space of probability measures on the Borel -algebra of , denoted as , endowed with the Prokhorov topology. We say that is infinitesimally invariant for the operator if for all , the latter denoting the functions in with compact support, and denote the set of these probability measures by . Let
[TABLE]
For we use the simple notation
[TABLE]
and define
[TABLE]
In other words, is the subset of consisting of those probability measures under which is integrable. It is simple to show that is always nonempty. Thus, since is coercive, the set
[TABLE]
is compact for all sufficiently large. Clearly, it is also convex.
Consider the minimization problem
[TABLE]
The lower semicontinuity of then implies that the infimum of Eq. LP is attained in . We let denote the set of points in which attain this infimum.
Our approach to the proof of uniqueness of positive solutions of Eq. EP is as follows: First, we show that if is any pair solving Eq. EP, with a positive function, then and some measure taking the form , with and denoting the Dirac mass at , attains the infimum in Eq. LP, that is, it belongs to . Next, we show that is a singleton, thus establishing the uniqueness of a positive solution to Eq. EP.
1.2. Notation
The standard Euclidean norm in is denoted by , and stands for the set of natural numbers. The closure, the boundary and the complement of a set are denoted by , and , respectively. The open ball of radius in , centered at , is denoted by , and is the ball centered at [math]. We use for .
For a Borel space , denotes the set of probability measures on its Borel -algebra, and denotes the Dirac mass at . For and a measurable function which is integrable under , we often use the simplifying notation .
2. Main results
Throughout this section we assume , unless otherwise explicitly mentioned. Also, without loss of generality we assume that , and we scale a solution of Eq. EP, which is bounded from below, by an additive constant so that .
We start with the very useful gradient estimate in [Ichihara-12, Theorem B.1], stated under weaker regularity in [Barles-16, Theorem A.2] for Eq. EP. It appears that the Bernstein approach for this estimate originates in [Lasry-89, Theorem A.1]. We need to use this estimate on small balls, so we scale it as follows.
Corollary 2.1**.**
There exists a constant such that for any solution of Eq. EP we have
[TABLE]
and for all . In particular, under (A1), with perhaps a different constant , we have
[TABLE]
Proof.
Fix any . For , let and f_{r}(y)\coloneqq r^{\gamma^{*}}\bigl{(}f(x+ry)-\lambda\bigr{)}. The function satisfies
[TABLE]
By [Ichihara-12, Theorem B.1], there exists a constant such that any solution of Eq. 2.3 satisfies
[TABLE]
from which Eq. 2.1 follows. ∎
We continue by proving a useful lower bound for positive solutions of Eq. EP. Define
[TABLE]
Lemma 2.1**.**
Assume (A1). Then, for every positive solution of Eq. EP, the following hold.
- (a)
There exist positive constants and such that
[TABLE] 2. (b)
There exists a positive constant such that
[TABLE]
Proof.
Note that by (A1) there exists some such that
[TABLE]
Choose positive and small enough such that . We claim that the assertion in part (a) holds for this . To prove this, we use contradiction. Suppose that
[TABLE]
along some sequence , such that as . We write Eq. EP as
[TABLE]
with . Simplifying the notation, let \Gamma_{n}\equiv\Gamma_{x_{n}}=\bigl{[}f(x_{n})\bigr{]}^{-\frac{1}{\gamma^{*}}}, and define the sequence of scaled functions
[TABLE]
for and . Then we obtain from Eq. EP that
[TABLE]
Integrating Eq. 2.6, we obtain
[TABLE]
Computing also the lower bound inherited from Eq. 2.6 and combining it with Eq. 2.9, we obtain
[TABLE]
This shows that and are bounded in uniformly in . To establish a bound for on , it is enough to show that, for some constant , we have
[TABLE]
By Eq. 2.2 we have
[TABLE]
Thus Eq. 2.11 follows by Eqs. 2.10 and 2.12.
Therefore, since, as we have shown, , and are bounded in uniformly in , then, by using for example [AA-Harnack, Lemma 3.6], we see that equation Eq. 2.8 contradicts the hypothesis in Eq. 2.7 that . This completes the proof of part (a).
Moving to part (b), let be as chosen in the proof of part (a). We have shown above that
[TABLE]
On the other hand, using the estimate Eq. 2.12 on , with and as in part (a), we have
[TABLE]
Therefore, Eq. 2.5 follows by Eqs. 2.4 and 2.13. This completes the proof. ∎
Remark 2.1*.*
The estimate in Lemma 2.1 is not suitable for the superquadratic case. A different scaling can be used when . First, we replace (A1) by
[TABLE]
Then, under Eq. 2.14, we obtain
[TABLE]
for some positive constants and . To prove this, we use \Gamma_{x}=\bigl{[}f(x)\bigr{]}^{\frac{1-\gamma}{3\gamma-2}}, and follow the proof of Lemma 2.1.
To continue, we need the following notation.
Notation 2.1**.**
For , we let be a concave function such that for , and for . Then and are nonnegative, and the latter is supported on . In addition, we select so that
[TABLE]
This is always possible. For example, we can specify as
[TABLE]
Recall the definitions in Eqs. 1.4, 1.5 and LP.
Lemma 2.2**.**
Assume (A1). For any positive solution of Eq. EP with eigenvalue and , we have
[TABLE]
In particular, .
Proof.
Since is coercive by Lemma 2.1 (a), it follows that is compactly supported. Thus we have
[TABLE]
by the definition of . On the other hand, we have
[TABLE]
Therefore,
[TABLE]
for all . By Lemma 2.1 we have for some constant . Using this together with Eq. 2.16, we obtain
[TABLE]
since by the definition of . Thus letting , and applying the monotone convergence theorem, we obtain Eq. 2.17, thus completing the proof. ∎
It is convenient to express the operator in Eq. 1.3 in terms of a family of operators defined by
[TABLE]
It is clear from the Legendre–Fenchel transform
[TABLE]
that, for any solution of Eq. EP, we have
[TABLE]
and that the maximum is realized at .
The next lemma applies to any .
Lemma 2.3**.**
Let . Let be a coercive solution of Eq. EP with eigenvalue . Define
[TABLE]
Then, there exists a Borel probability measure on , such that
[TABLE]
and
[TABLE]
In particular, .
Proof.
Using the definition in Eq. 2.18, we write
[TABLE]
Since is coercive, we can apply [Bogachev-01d, Theorem 1.2] to assert the existence of a unique which satisfies
[TABLE]
Thus Eq. 2.20 follows from Eq. 2.23 and the definition of , whereas Eq. 2.21 follows by integrating Eq. 2.22 with respect to using a cut-off function as in the proof of Lemma 2.2, which shows that for any positive function . ∎
Remark 2.2*.*
Lemma 2.3 can also be established by a simple probabilistic argument. Viewing Eq. 2.20 as a Foster–Lyapunov equation, it is well known that the coercivity of implies that the diffusion with extended generator is positive recurrent. The measure can then be specified as the unique invariant probability measure of this diffusion.
We now discuss some properties of the set of infinitesimally invariant measures which are needed for the proof of Theorem 2.1 below. It is clear that every can be disintegrated into a probability measure and a Borel measurable probability kernel on . We denote this disintegration by . For , define . Then is a Borel measurable map. It is straightforward to verify that is in . Since, by convexity, we have
[TABLE]
it is clear that the infimum in Eq. LP is attained at some whose disintegration results in a kernel which is Dirac for each . Then, can be represented as a Borel measurable map , and vice-versa. We denote the class of such measures as , and abusing the notation we represent them as . Consider such a in . It follows by Eq. 2.17 that . Since , this implies that , and thus has density with respect to the Lebesgue measure by [Bogachev-96, Theorem 1.1].
We continue with our main theorem. Recall the definition of in Eq. 1.1, and that denotes the subset of consisting of points that attain the infimum in Eq. LP.
Theorem 2.1**.**
Assume (A1). The following hold.
- (a)
For any positive solution of Eq. EP with eigenvalue we have
[TABLE]
with as in Lemma 2.3. 2. (b)
The set is a singleton. 3. (c)
There exists a unique positive solution of Eq. EP.
Proof.
By Lemma 2.1 every positive solution of Eq. EP is coercive. Therefore, the first two equalities in part (a) follow by Lemmas 2.2 and 2.3. By [Barles-16, Theorem 2.6], there exists a solution with eigenvalue which is bounded from below. This of course implies thus completing the proof of part (a).
Let and be as in Lemma 2.3. Let be any element of . By the discussion in the paragraph preceding the theorem, has a density with respect to the Lebesgue measure. Let denote the density of , which, as well known, is strictly positive. Let
[TABLE]
It is straightforward to verify that .
By optimality, we have
[TABLE]
by convexity. Thus . Since is strictly positive, (2.24) implies that on the support of . It is clear that if is modified outside the support of , then the modified measure is also infinitesimally invariant for . Therefore . The uniqueness of a probability measure satisfying Eq. 2.23 then implies that , which in turn implies (since on the support of ) that a.e. in . This completes the proof of part (b).
Turning to part (c), existence of a positive solution follows from [Barles-16, Theorem 2.6]. By part (b), for any positive solutions and , we have a.e. in , implying that on . Thus the solution is unique up to an additive constant. This completes the proof. ∎
3. More general Hamiltonians
In this section we consider viscous equations taking the form
[TABLE]
with more general Hamiltonians . We adopt the following assumptions.
- (A2)
The function is in and is coercive. The Hamiltonian satisfies the following.
- (i)
H\in{C}^{2}({\mathds{R}^{d}}\times\bigl{(}{\mathds{R}^{d}}\setminus\{0\})\bigr{)}, and is strictly convex for all . 2. (ii)
There exist constants and , such that
[TABLE]
The hypothesis (A2) is equivalent to (A2′) below for the Lagrangian , which is related to via the Fenchel–Legendre transform, that is,
[TABLE]
- (A2′)
The function is in and is coercive. The Lagrangian satisfies the following.
- (i)
L\in{C}^{2}({\mathds{R}^{d}}\times\bigl{(}{\mathds{R}^{d}}\setminus\{0\})\bigr{)}, and is strictly convex for all . 2. (ii)
There exist constants and , such that
[TABLE]
In addition, under (A2) or (A2′), there exists positive constants and such that
[TABLE]
for all , and
[TABLE]
with equality if and only if or .
The model above is slightly more general than the model in [Ichihara-12, Cirant-14]. A more restrictive assumption on is used in [Ichihara-12], while does not depend on in [Cirant-14]. For the properties mentioned above see [Ichihara-12, Theorem 3.4] and [Cirant-14, Proposition 4.1].
As mentioned earlier, [Ichihara-12] imposes the assumptions in Eq. 1.2 for , for both the subquadratic and superquadratic cases. Barles in [Barles-16] uses Eq. 1.2 only for the subquadratic case, while [Cirant-14] does not consider unbounded for the subquadratic case. Analogous is the model in [Ben-Fre, Section 4.6].
The results for this Hamiltonian are essentially the same as those in Section 2. We need the following ramification of [Ichihara-12, Theorem B.1] analogous to Corollary 2.1 valid for solutions of Eq. 3.1.
Corollary 3.1**.**
Assume (A2). Then, there exists a constant such that any solution of Eq. 3.1 satisfies Eq. 2.1.
Proof.
A closer inspection of the proof of [Ichihara-12, Theorem B.1] reveals that the following is established. Let be a coercive function. There exists a function such that if satisfies
[TABLE]
for a pair of positive constants , then
[TABLE]
We use scaling. With a solution of Eq. 3.1, we define . Using (A2) (ii) and Eq. 3.3, we deduce that and g\equiv r^{\gamma^{*}}\bigl{(}f(x+ry)-\lambda\bigr{)} satisfy Eq. 3.5 for all and for constants and which do not depend on . The result then follows by Eq. 3.6. ∎
For the model in Eq. 3.1, we define
[TABLE]
and and as in Eq. 1.5 and Eq. LP, respectively, relative to in Eq. 3.7. We also let
[TABLE]
Recall that is the set of measures in that attain the infimum in Eq. LP.
Theorem 3.1**.**
- (a)
The conclusions of Lemma 2.1 hold. 2. (b)
For any positive solution of Eq. 3.1 with eigenvalue and , we have
[TABLE]
there exists a Borel probability measure on , such that, with as defined in Eq. 3.8, we have
[TABLE]
and
[TABLE]
In particular, . 3. (c)
We have , with as in Eq. 3.10. 4. (d)
There exists at most one positive solution of Eq. 3.1.
Proof.
Part (a) follows as in Lemma 2.1 with a slight modification. Instead of Eq. 2.8, we use the inequality
[TABLE]
with \xi_{n,u}(y)\,\coloneqq\,\Gamma_{n}\,\xi_{u}\bigl{(}x_{n}+\Gamma_{n}y\bigl{)}, and as in Eq. 3.8. Then we apply Eq. 3.3 and Corollary 3.1, and follow the proof of Lemma 2.1.
For part (b), we define
[TABLE]
and write
[TABLE]
and following the proof of Lemma 2.2, we obtain
[TABLE]
The remaining assertions in (b) follow by Lemma 2.3, with as defined in Eq. 3.8.
Part (c) follows as in Lemma 2.3 with a slight difference. For we don’t know a priori that is integrable under a measure in . So instead of the densities and we use the Radon–Nikodym derivatives.
Part (d) follows from parts (b) and (c). This concludes the proof. ∎
Remark 3.1*.*
Theorem 3.1 does not address existence of a positive solution to Eq. 3.1. For Hamiltonians not depending on , existence is asserted in [Cirant-14]. In general, under some additional assumptions, we can show that there exists a positive solution to Eq. 3.1. In addition to (A1)–(A2), we also assume that for any bounded domain , there exists a constant satisfying the following: for every there exists such that
[TABLE]
This is same as [LP16, (2.23)]. It then follows from [LP16, Theorem 2.15] that there exists a unique and a constant satisfying
[TABLE]
Furthermore, is characterized as follows:
[TABLE]
Thus is monotone decreasing as a function of . Denote by the solution pair of Eq. 3.13 corresponding to , and let . It follows from the equation above that
[TABLE]
which implies that
[TABLE]
Let which exists by the above estimate. It also clear that attains its minimum in a compact set independent of . Thus we can follow a standard argument (see [Barles-16, Theorem 2.6]) to show that as , and
[TABLE]
Assumption (3.12) is satisfied by a large class of Hamiltonian. For instance, consider
[TABLE]
for some bounded function . Then we can choose above. Note that (3.12) follows from the estimate below.
[TABLE]
where the constant depends on and . Now choose .
3.1. Remarks on the superquadratic case
Cirant in [Cirant-14] is adopting (A2), except that in his model the Hamiltonian does not depend on , that is, . He also assumes that and have at most polynomial growth. He shows that there always exists a positive solution to Eq. 3.1, and that this has a at least linear growth.
For this model we can establish that for all , and that therefore, Eq. 3.9 holds. However, the proof of this differs from the proof of Lemma 2.2. We choose instead a smooth concave function such that for , and for , and we scale it by defining for . Since , and has polynomial growth, while has at least linear growth, we can follow the argument in the proof of [ABBK, Theorem 4.1] to conclude that for all . In [Cirant-14], the set of admissible controls are required to satisfy . This is an unnecessary restriction on the class of admissible controls, and can be avoided. Without assuming that is strictly convex, which might result in non-uniqueness for , the approach summarized above, shows that is an optimal Markov control and the corresponding infinitesimal measure is a minimizer of Eq. LP. Thus, we have a strong notion of optimality as explained in Section 4. Under the additional assumption that is strictly convex, the positive solution , and therefore also the optimal Markov control are unique.
4. Implications for the ergodic control problem
The problem Eq. EP is associated with an ergodic control problem for the diffusion given by the Itô stochastic differential equation
[TABLE]
This equation is specified on a complete, filtered probability space \bigl{(}\Omega,{\mathfrak{F}},\operatorname{\mathbb{P}},({\mathfrak{F}}_{t})_{t\geq 0}\bigr{)}, with an -adapted -dimensional Brownian motion. An admissible control is an -valued -progressively measurable process , such that \operatorname{\mathbb{E}}\Bigl{[}\int_{0}^{T}\lvert\xi_{t}\rvert^{\gamma^{*}}\,\mathrm{d}{t}\Bigr{]}<\infty for all , and we let denote the class of such controls. The running cost function is given by in Eq. 1.4.
In [Ichihara-12], optimality is established via the study of the parabolic problem. In view of the optimality results concerning Eq. LP, we can state a stronger version of optimality. We state this result for the model in Eq. EP, noting that an identical argument can be used to establish this for Eq. 3.1 under (A2). We let denote the expectation operator for the diffusion in Eq. 4.1 controlled by with initial condition .
Theorem 4.1**.**
Assume (A1). Let be the unique positive solution of Eq. EP in the subquadratic case, or as in Section 3.1 for the superquadratic case. With as in Eq. 2.19, we have
[TABLE]
Moreover, Eq. 4.2 holds without the expectation operators in the a.s. pathwise sense. In addition a Markov control is optimal, if and only if it agrees with a.e. in .
Proof.
The inequality in Eq. 4.2 follows from the fact that limit points of mean empirical measures in are infinitesimal measures for the operator (see Lemma 3.4.6 and Theorem 3.4.7 in [ABG12]) together with the definition of in Eq. LP. The equality follows by the ergodicity of the process under the control and the fact that is integrable under the invariant probability measure as asserted in Lemma 2.3. The pathwise results also follow from results in [ABG12] referenced above. The verification part of the theorem follows from Theorem 2.1. ∎
Acknowledgements
The work of Ari Arapostathis and Luis Caffarelli was supported in part by the National Science Foundation through grant DMS-1715210. The work of Ari Arapostathis was also supported in part by the Army Research Office through grant W911NF-17-1-001. The research of Anup Biswas was supported in part by an INSPIRE faculty fellowship and DST-SERB grant EMR/2016/004810.
References
