Min-max formulas for nonlocal elliptic operators on Euclidean space

Nestor Guillen; Russell Schwab

arXiv:1812.09642·math.AP·October 18, 2019

Min-max formulas for nonlocal elliptic operators on Euclidean space

Nestor Guillen, Russell Schwab

PDF

TL;DR

This paper revisits the min-max representation of nonlocal elliptic operators satisfying the Global Comparison Property in Euclidean space, clarifying the proof and extending results for operators with translation invariance or spatial regularity.

Contribution

It simplifies the proof of min-max formulas for nonlocal elliptic operators in Euclidean space and introduces new results for translation-invariant and spatially regular operators.

Findings

01

Clarified proof of min-max representation in Euclidean space

02

Extended results to translation-invariant operators

03

Established new properties for spatially regular operators

Abstract

An operator satisfies the Global Comparison Property if anytime a function touches another from above at some point, then the operator preserves the ordering at the point of contact. This is characteristic of degenerate elliptic operators, including nonlocal and nonlinear ones. In previous work, the authors considered such operators in Riemannian manifolds and proved they can be represented by a min-max formula in terms of L\'evy operators. In this note we revisit this theory in the context of Euclidean space. With the intricacies of the general Riemannian setting gone, the ideas behind the original proof of the min-max representation become clearer. Moreover, we prove new results regarding operators that commute with translations or which otherwise enjoy some spatial regularity.

Equations774

u \leq v in R^{d} and u (x) = v (x) \Rightarrow I (u, x) \leq I (v, x) .

u \leq v in R^{d} and u (x) = v (x) \Rightarrow I (u, x) \leq I (v, x) .

I (u, x) = \int_{R^{d}} u (x + y) - u (x) - χ_{B_{1}} (y) \nabla u (x) \cdot y ν (d y),

I (u, x) = \int_{R^{d}} u (x + y) - u (x) - χ_{B_{1}} (y) \nabla u (x) \cdot y ν (d y),

I (u, x) = F (D^{2} u (x), \nabla u (x), u (x)),

I (u, x) = F (D^{2} u (x), \nabla u (x), u (x)),

L (u, x)

L (u, x)

+ \int_{R^{d}} u (x + y) - u (x) - \mathbbm 1_{B_{1} (0)} (y) \nabla u (x) \cdot y μ (x, d y),

with A (x) \geq 0, and x sup \int_{R^{d}} min (∣ y ∣^{2}, 1) μ (x, d y) < \infty.

I (u, x) = a min b max {f_{ab} (x) + L_{ab} (u, x)} .

I (u, x) = a min b max {f_{ab} (x) + L_{ab} (u, x)} .

I (τ_{z} u, x) = I (u, x + z), where τ_{z} u (x) := u (x + z) .

I (τ_{z} u, x) = I (u, x + z), where τ_{z} u (x) := u (x + z) .

∥ I (u) - I (v) ∥_{L^{\infty} (B_{R} (x_{0}))} \leq ρ (R) ∥ u - v ∥_{L^{\infty} (R^{d})} .

∥ I (u) - I (v) ∥_{L^{\infty} (B_{R} (x_{0}))} \leq ρ (R) ∥ u - v ∥_{L^{\infty} (R^{d})} .

∣ I (v + τ_{- z} u, x + z) - I (v, x + z) - (I (v + u, x) - I (v, x)) ∣

∣ I (v + τ_{- z} u, x + z) - I (v, x + z) - (I (v + u, x) - I (v, x)) ∣

\leq ω (∣ z ∣) C (r) (∥ u ∥_{C^{2} (B_{2 r} (x))} + ∥ u ∥_{L^{\infty} (C B_{r} (x))}) .

∥ μ (x + x, \cdot) - μ (x, \cdot) ∥_{T V (C B_{r})} \leq C ω (∣ z ∣) .

∥ μ (x + x, \cdot) - μ (x, \cdot) ∥_{T V (C B_{r})} \leq C ω (∣ z ∣) .

as y \to x, u (y) - u (x) - \nabla u (x) \cdot (y - x) - \frac{1}{2} (y - x) \cdot (D^{2} u (x) (y - x)) \leq o (∣ y - x ∣^{2}) .

as y \to x, u (y) - u (x) - \nabla u (x) \cdot (y - x) - \frac{1}{2} (y - x) \cdot (D^{2} u (x) (y - x)) \leq o (∣ y - x ∣^{2}) .

as y \to x, ∣ u (y) - u (x) - \nabla u (x) \cdot (y - x) ∣ \leq o (∣ y - x ∣),

as y \to x, ∣ u (y) - u (x) - \nabla u (x) \cdot (y - x) ∣ \leq o (∣ y - x ∣),

I (u, x) = v \in C_{b}^{2} (R^{d}) min L \in K (I)_{x} max {I (v, x) + L (u - v)} .

I (u, x) = v \in C_{b}^{2} (R^{d}) min L \in K (I)_{x} max {I (v, x) + L (u - v)} .

L (u) = tr (A_{x} D^{2} u (x)) + B_{x} \cdot \nabla u (x) + C_{x} u (x) + \int_{R^{d}} u (x + y) - u (x) - \mathbbm 1_{B_{1} (0)} (y) \nabla u (x) \cdot y μ_{x} (d y),

L (u) = tr (A_{x} D^{2} u (x)) + B_{x} \cdot \nabla u (x) + C_{x} u (x) + \int_{R^{d}} u (x + y) - u (x) - \mathbbm 1_{B_{1} (0)} (y) \nabla u (x) \cdot y μ_{x} (d y),

∣ A_{x} ∣ + ∣ B_{x} ∣ + ∣ C_{x} ∣ + \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{x} (d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

∣ A_{x} ∣ + ∣ B_{x} ∣ + ∣ C_{x} ∣ + \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{x} (d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

I (u, x) = a min b max {f_{ab} + L_{ab} (u, x)} .

I (u, x) = a min b max {f_{ab} + L_{ab} (u, x)} .

∣ f_{ab} ∣ + ∣ A_{ab} ∣ + ∣ B_{ab} ∣ + ∣ C_{ab} ∣ + \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{ab} (d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

∣ f_{ab} ∣ + ∣ A_{ab} ∣ + ∣ B_{ab} ∣ + ∣ C_{ab} ∣ + \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{ab} (d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

I (u, x) = a min b max {f_{ab} (x) + L_{ab} (u, x)},

I (u, x) = a min b max {f_{ab} (x) + L_{ab} (u, x)},

∥ f_{ab} ∥_{L^{\infty}} + ∥ A_{ab} ∥_{L^{\infty}} + ∥ B_{ab} ∥_{L^{\infty}} + ∥ C_{ab} ∥_{L^{\infty}} + x sup \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{ab} (x, d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

∥ f_{ab} ∥_{L^{\infty}} + ∥ A_{ab} ∥_{L^{\infty}} + ∥ B_{ab} ∥_{L^{\infty}} + ∥ C_{ab} ∥_{L^{\infty}} + x sup \int_{R^{d}} min {1, ∣ y ∣^{2}} μ_{ab} (x, d y) \leq C ∥ I ∥_{Lip, C_{b}^{2} \to C_{b}^{0}} .

∥ μ_{ab} (x_{1}) - μ_{ab} (x_{2}) ∥_{TV (C B_{r})} \leq C (r) ω (2∣ x_{1} - x_{2} ∣),

∥ μ_{ab} (x_{1}) - μ_{ab} (x_{2}) ∥_{TV (C B_{r})} \leq C (r) ω (2∣ x_{1} - x_{2} ∣),

\displaystyle\begin{array}[]{rl}&\text{if}\ \beta=2+\gamma,\ \textnormal{for}\ \gamma\in(0,1),\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{2,\gamma}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=2^{+},\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{2}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=2,\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1,1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1+\gamma,\ \text{for}\ \gamma\in(0,1),\text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1,\gamma}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1^{+},\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1,\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{0,1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=\gamma,\ \text{for}\ \gamma\in(0,1),\text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{0,\gamma}_{b}(\mathbb{R}^{d}).\end{array}

\displaystyle\begin{array}[]{rl}&\text{if}\ \beta=2+\gamma,\ \textnormal{for}\ \gamma\in(0,1),\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{2,\gamma}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=2^{+},\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{2}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=2,\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1,1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1+\gamma,\ \text{for}\ \gamma\in(0,1),\text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1,\gamma}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1^{+},\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=1,\ \text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{0,1}_{b}(\mathbb{R}^{d});\\ &\text{if}\ \beta=\gamma,\ \text{for}\ \gamma\in(0,1),\text{then, we mean}\ C^{\beta}_{b}(\mathbb{R}^{d})=C^{0,\gamma}_{b}(\mathbb{R}^{d}).\end{array}

L_{ab} (u, x)

L_{ab} (u, x)

a, b, x sup \int_{R^{d}} min {1, ∣ y ∣^{β}} μ_{ab} (x, d y) < \infty.

a, b, x sup \int_{R^{d}} min {1, ∣ y ∣^{β}} μ_{ab} (x, d y) < \infty.

I (τ_{- h} u, x + h) - I (0, x + h) = I (u, x) - I (0, x),

I (τ_{- h} u, x + h) - I (0, x + h) = I (u, x) - I (0, x),

I (τ_{- h} u, x + h) = I (u, x),

I (τ_{- h} u, x + h) = I (u, x),

I (u, x) = F (x, u (x), \nabla u (x), D^{2} u (x)),

I (u, x) = F (x, u (x), \nabla u (x), D^{2} u (x)),

L_{β}^{\infty} := {h \in L^{\infty} (R^{d}) ∣ ∣ h (y) ∣ = O (∣ y ∣^{β}) as ∣ y ∣ \to 0},

L_{β}^{\infty} := {h \in L^{\infty} (R^{d}) ∣ ∣ h (y) ∣ = O (∣ y ∣^{β}) as ∣ y ∣ \to 0},

L_{β}^{\infty}

L_{β}^{\infty}

y sup ∣ h (y) ∣ min {1, ∣ y ∣^{β}}^{- 1} .

y sup ∣ h (y) ∣ min {1, ∣ y ∣^{β}}^{- 1} .

F : L_{β}^{\infty} (R^{d}) \times S_{d} \times R^{d} \times R \times R^{d} \to R .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Min-max formulas for nonlocal elliptic operators on Euclidean Space

Nestor Guillen

and

Russell W. Schwab

Department of Mathematics

University of Massachusetts, Amherst

Amherst, MA 01003-9305

[email protected]

Department of Mathematics

Michigan State University

619 Red Cedar Road

East Lansing, MI 48824

[email protected]

(Date: Monday 18th February, 2019 (This is the revised version per referee suggestions.))

Abstract.

An operator satisfies the Global Comparison Property if anytime a function touches another from above at some point, then the operator preserves the ordering at the point of contact. This is characteristic of degenerate elliptic operators, including nonlocal and nonlinear ones. In previous work, the authors considered such operators in Riemannian manifolds and proved they can be represented by a min-max formula in terms of Lévy operators. In this note we revisit this theory in the context of Euclidean space. With the intricacies of the general Riemannian setting gone, the ideas behind the original proof of the min-max representation become clearer. Moreover, we prove new results regarding operators that commute with translations or which otherwise enjoy some spatial regularity.

Key words and phrases:

Global Comparison Principle, Integro-differential operators, Isaacs equation, Whitney extension, Dirichlet-to-Neumann, fully nonlinear equations

2010 Mathematics Subject Classification:

35J99, 35R09, 45K05, 46T99, 47G20, 49L25, 49N70, 60J75, 93E20

The authors gratefully acknowledge partial support from the National Science Foundation while this work was in progress: N. Guillen DMS-1700307 and R. Schwab DMS-1665285. The authors also thank the anonymous referee for some suggestions that we believe improved the presentation of our results.

1. Introduction

A map $I:C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is said to satisfy the Global Comparison Property (GCP) if

[TABLE]

The Laplacian operator, as well as its fractional powers $-(-\Delta)^{\alpha/2}$ ( $\alpha\in(0,2)$ ) all satisfy this property. More generally, given a Lévy measure $\nu(dy)$ (a measure on $\mathbb{R}^{d}\setminus\{0\}$ such that $\min\{1,|y|^{2}\}$ is integrable with respect to $\nu$ ) the operator

[TABLE]

will have the GCP. The GCP is also satisfied by Dirichlet-to-Neumann maps for elliptic equations, generators of Markov processes, Bellman-Isaacs operators in control and differential games, among many examples. When the operator is known a priori to be local, then nonlinear examples of maps with the GCP are of the form,

[TABLE]

where $F:\mathbb{S}_{d}\times\mathbb{R}^{d}\times\mathbb{R}\to\mathbb{R}$ is monotone in its first argument, and Lipschitz continuous in all arguments.

The main contribution of this article is to address when certain operators acting on $C^{2}_{b}(\mathbb{R}^{d})$ must necessarily enjoy a structure similar to those examples above. The canonical object used to address this question will be a linear operator we choose to say is “of Lévy type”: those operators for which there exist functions, $A(x)\in\mathbb{S}_{d}$ , $B(x)\in\mathbb{R}^{d}$ , $C(x)\in\mathbb{R}$ , and measures $\mu(x,dy)$ so that

[TABLE]

We will review some recent results that show for $I:C^{2}_{b}(\mathbb{R}^{d})\to C_{b}(\mathbb{R}^{d})$ that enjoys the GCP, is Lipschitz, and has a natural structural constraint, there exists a family of functions, $f_{ab}$ and linear operators of Lévy type, $L_{ab}$ , so that

[TABLE]

For linear operators, in the 1960’s Courrège [19] showed that all of those that satisfy the GCP must have the form given in (1.2). All of our results here should be considered an extension of Courrège’s result to the nonlinear setting.

In our previous work, [29], we showed such a min-max representation in (1.3). The result in [29] in fact dealt with a more general situation where $I:C^{2}_{b}(M)\to C^{0}_{b}(M)$ where $M$ is a complete Riemannian manifold. We will review the proof of this result in the context of Euclidean space, where many of the arguments simplify greatly. Moreover, we prove two refinements of the main result from [29] relevant to the Euclidean case, one involving translation invariant operators and one for operators that behave continuously with respect to translation operators. Stated informally, our results are the following:

Theorem 1.

An operator $I(u,x)$ that is Lipschitz and satisfies the GCP admits a min-max formula in terms of Lévy type operators.

Theorem 2.

In the previous theorem, assume further that $I(u,x)$ commutes with translations. Then the Lévy operators appearing in the min-max formula all commute with translations.

Theorem 3.

Instead of translation invariance assume that the finite differences of $I(u,x)$ commute with translations up to a certain error depending on a modulus of continuity $\omega(\cdot)$ . Then the Lévy operators appearing in the min-max formula have continuous coefficients with common modulus of continuity of the form $C\omega(2(\cdot))$ .

Theorem 1 above is a special case of the main result in [29], and Theorems 2 and 3 are new.

1.1. Assumptions and main results

Here are our main assumptions.

Assumption 1.1.

The map $I:C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is Lipschitz continuous and has the Global Comparison Property (1.1).

Assumption 1.2.

The map $I:C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is translation invariant. Namely, for any $x,z\in\mathbb{R}^{d}$ and $u\in C^{2}_{b}(\mathbb{R}^{d})$ we have

[TABLE]

Assumption 1.3.

There is a non-increasing function $\rho:(0,\infty)\to\mathbb{R}$ with $\rho(R)\to 0$ as $R\to\infty$ such that if $u,v\in C^{2}_{b}(\mathbb{R}^{d})$ are such that $u\equiv v$ in $B_{2R}(x_{0})$ , then

[TABLE]

Assumption 1.4.

There exists a modulus, $\omega$ , for all $v,u\in C^{2}_{b}(\mathbb{R}^{d})$ , $x,z\in\mathbb{R}^{d}$ , $r>0$ , we have

[TABLE]

It is allowed that $C(r)\to\infty$ as $r\to 0$ ; in some examples $C(r)$ may be bounded and in some it may be unbounded.

The meaning of Assumption 1.1 and Assumption 1.2 is self-evident. Assumption 1.3 seems rather technical, but it will be necessary to obtain compactness for a family of measures arising in the proof (and this assumption is satisfied by a broad family of examples). Note however that this assumption is not needed for the translation invariant case as well as the setting of Theorem 1.9 as these two theorems are obtained with different methods.

Last but not least, Assumption 1.4 can be thought of as a “coefficient regularity” assumption. For instance, in the linear and local case, in which $I$ is a Lévy operator without integral part, Assumption 1.4 is equivalent to the coefficients of the operator having modulus of continuity $C\omega(\cdot)$ for some constant $C>0$ . In fact, Assumption 1.4 is stated so that it indeed linearizes to this usual assumption that one expects in the linear case.

Remark 1.5.

As mentioned above, one can check that for linear operators, Assumption 1.4 is equivalent to the coefficients of the local part being uniformly continuous and the Lévy measures being uniformly continuous in the TV norm along shifts in the base point, i.e.

[TABLE]

By its design, Assumption 1.4 is a technical artifact of our proof, and as such, it is unlikely to be sharp or even the most natural assumption. There is most likely room for improvement here. In fact, one indication of the possibility to make a more natural assumption lies in the fact that even when the original operator, $I$ , is translation invariant (so the most regular dependence on $x$ ), it does not necessarily follow that $I$ also satisfies Assumption 1.4. This also reflects the fact that we have taken a two completely different methods of proof for the results that concern translation invariant operators, and ones that have a modulus with respect to translations.

Remark 1.6.

In Section 6, we give a short list of some operators that fall within the scope of Assumptions 1.1–1.4 and Theorems 1.9–1.14. At the end of Section 6, we give a list of which assumptions each example satisfies.

Remark 1.7.

We note that one subtle improvement of the current work upon our previous one in [29] is that because of a more streamlined proof for the translation invariant case, we were able to establish the non-translation invariant case, Theorem 1.9 (below), without the technical Assumption 1.3. This is purely an artifact of using an approximation scheme in [29] to treat all operators by the same method, and this turns out to have been not essential when one does not want the extra information provided by Theorems 1.11 and 1.14.

The first theorem uses the notion of “pointwise” $C^{2}$ or $C^{1}$ , and so we will define that property here.

Definition 1.8.

For a fixed $x$ we say that $u\in C^{2}(x)$ (“pointwise $C^{2}$ at $x$ ”) if there exists a vector, $\nabla u(x)$ , and a symmetric matrix, $D^{2}u(x)$ , such that

[TABLE]

Similarly if $u$ only enjoys the existence of $\nabla u(x)$ and

[TABLE]

we say that $u\in C^{1}(x)$ (“pointwise $C^{1}$ at $x$ ”).

Now we can restate Theorems 1–3 above, in more precise terms.

Theorem 1.9.

If $I:C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ satisfies Assumption 1.1, then, for each $x$ , there exists a family of linear functionals on $C^{2}(x)$ that depend on $I$ and $x$ , called $\mathcal{K}(I)_{x}$ , so that for all $u\in C^{2}(x)$

[TABLE]

Here, each $L\in\mathcal{K(I)}_{x}$ , has the form

[TABLE]

and for some universal $C$ , the terms also satisfy the bound for all $x$ :

[TABLE]

The proof of Theorem 1.9 appears in Section 3.1, which is at the end of Section 3.

We want to point out to the reader that the notation in Theorem 1.9 is intentional in its use of subscripts for e.g. $A_{x}$ , etc. This is because our construction does not actually produce $L$ as a linear mapping $C^{2}_{b}\to C^{0}_{b}$ , and so it is not correct to think of having a family of $L$ whose coefficients are actually functions of $x$ . Rather, it just says that at each $x$ there is a family functionals that have the desired structure, but it is not clear that they can be put together across all $x$ to make a family of $x$ -dependent operators.

This situation changes under other assumptions, and in the next two theorems, our method produces a family of linear operators mapping $C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ , all of the form (1.2).

Theorem 1.10.

If $I:C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ satisfies Assumption 1.1 and Assumption 1.2 then there exists a family, $\displaystyle\{f_{ab},L_{ab}\}_{a,b\in\mathcal{K}(I)},$ that depends only on $I$ , where for all $a,b$ , $f_{ab}$ are constants, and $L_{ab}$ are linear translation invariant operators mapping $C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ of the form (1.2) (i.e. constant coefficients), and for all $u\in C^{2}_{b}(\mathbb{R}^{d})$ and $x\in\mathbb{R}^{d}$ we have

[TABLE]

Furthermore, for a universal $C$ , for all $f_{ab}$ and $L_{ab}$ ,

[TABLE]

The proof of Theorem 1.10 appears in Section 3.1, which is at the end of Section 3.

Theorem 1.11.

If $I:C^{2}_{b}(\mathbb{R}^{d})\to C_{b}^{0}(\mathbb{R}^{d})$ satisfies Assumption 1.1, Assumption 1.3, and Assumption 1.4, then, there exists a family, $\displaystyle\{f_{ab},L_{ab}\}_{a,b\in\mathcal{K}(I)},$ that depends only on $I$ , where for all $a,b$ , $f_{ab}\in C^{0}_{b}(\mathbb{R}^{d})$ are functions, and $L_{ab}$ are linear operators mapping $C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ of the form (1.2), and for all $u\in C^{2}_{b}(\mathbb{R}^{d})$ , we have

[TABLE]

and for a universal $C$ , for all $f_{ab}$ and $L_{ab}$ ,

[TABLE]

Furthermore, if $\omega$ is as in Assumption 1.4, then the functions $f_{ab},A_{ab},B_{ab},C_{ab},$ all have a modulus of continuity $C\omega(2\cdot)$ , while for each $r>0$ we have the estimate,

[TABLE]

where as above, $C(r)>0$ , is a constant that may possibly (but not necessarily) have the property that $C(r)\to\infty$ as $r\to 0$ .

The proof of Theorem 1.11 appears in Section 5.5, which is at the end of Section 5.

Finally, we give a theorem that reduces the possible terms in the min-max over (1.2). Namely, there are instances in which there may be no second order terms or first order terms. To state this, we abuse notation slightly, and we give a shorthand as $C^{\beta}_{b}(\mathbb{R}^{d})$ to mean the following:

[TABLE]

Definition 1.12.

For a fixed $x$ , we say that $u\in C^{\beta}(x)$ (“pointwise $C^{\beta}(x)$ ”) if the same requirements of Definition 1.8 hold, but the estimate on the right hand side takes into account the different decay as follows:

•

if, $\beta=2+\gamma$ , then $u$ has a second order Taylor expansion and the right hand side is $O(\left|y-x\right|^{2+\gamma})$ ;

•

if, $\beta=2^{+}$ , then $u$ has a second order Taylor expansion and the right hand side is $o(\left|y-x\right|^{2})$ ;

•

if, $\beta=2$ , then we include this in the previous case whenever $u$ has a second order taylor expansion at $x$ ;

•

if, $\beta=1+\gamma$ , then $u$ has a first order Taylor expansion and the right hand side is $O(\left|y-x\right|^{1+\gamma})$ ;

•

if, $\beta=1^{+}$ , then $u$ has a first order Taylor expansion and the right hand side is $o(\left|y-x\right|)$ ;

•

if, $\beta=1$ , then we include this in the previous case whenever $u$ has a first order taylor expansion at $x$ ;

•

if, $\beta=\gamma\in(0,1)$ , then $\left|u(y)-u(x)\right|\leq C\left|y-x\right|^{\gamma}$ .

Assumption 1.13.

All of Assumptions 1.1 – 1.4 hold, but with all instances of $C^{2}_{b}(\mathbb{R}^{d})$ replaced by $C^{\beta}_{b}(\mathbb{R}^{d})$ .

Theorem 1.14.

For each of Theorems 1.9, 1.10, 1.11, we have the following variation: in each case assume that $I$ satisfies Assumption 1.13, for some $\beta\in[0,2^{+}]$ (as enumerated above). Then, taking into account Definition 1.12 for Theorem 1.9, the min-max formula holds in each of the previous results with the following additions: if $\beta<2$ then $A_{ab}=0$ for all $a,b$ , while if $\beta<1$ then $B_{ab}=0$ for all $a,b$ and the operators $L_{ab}$ take the form

[TABLE]

Moreover, the smaller $\beta$ , the more regular the Lévy measures $\mu_{ab}$ are at $y=0$ , namely, we have

[TABLE]

The proof of Theorem 1.14 appears in Section 5.5, which is at the end of Section 5.

Remark 1.15.

In Sections 4 and 5, one can see that at its heart, the fact that the modulus for $I$ is passed onto the coefficient functions in (1.2) is a consequence of our choice to use a Whitney extension in an approximation to $I$ , and the Whitney extension is well known to preserve a modulus of continuity. The actual details are a bit more involved, but that is the main reason. We note the presence of the factor of $2$ in the new modulus is a consequence of the Whitney Extension method; the interested reader can see [54, Chapter VI].

A further comment regarding the assumptions is in order. Suppose that $I$ satisfies Assumption 1.4 with $\omega\equiv 0$ . In this case, taking $v\equiv 0$ the assumption says that

[TABLE]

and if we further assume that $I(0,x)$ is constant (i.e. $I$ applied to the zero function returns a constant), then we have

[TABLE]

that is, $I$ is translation invariant. However, at first sight it is not clear what happens in the reverse direction. That is, we do not know how to show that a translation-invariant operator automatically satisfies Assumption 1.4 with $\omega\equiv 0$ , and in fact we expect that this assumption can be modified so that it seamlessly includes the translation invariant operators as well.

1.2. Notation

For the readers’ convenience, a summary of symbols used in the paper is presented below.

[TABLE]

1.3. Background

There were roughly two reasons that motivated the results we present in this paper. First of all, the link between elliptic equations and a min-max formula for operators has a long history, and it has been exploited extensively in the case of local operators. Until [29], the connection was not known for nonlocal, nonlinear operators. Even so, the link between the two was natural enough that there are at least a few results that assumed a structure like (1.3), including [5], [35], [40], [47], [48], [51], among many others. Thus the theorems here and in [29] give a sort of a posteriori justification to min-max assumptions that appeared in earlier works. Secondly, a formula such as (1.3) can be very useful in connecting results about the integro-differential theory (of which, there has been a large volume recently) with some other pursuits that may not obviously relate to operators such as (1.2). Two recent projects that exploit or were motivated by the min-max formulas are on some Hele-Shaw type free boundary evolutions in [16] and some Neumann homogenization problems [30] [31]. Both of these relate to linear and nonlinear Dirichlet-to-Neumann maps, studied in [26], and there is plenty more to learn about the integro-differential structure in the nonlinear setting. The choice to pursue continuity properties such as the dependence given in (1.5), although a posteriori seems straightforward, was not initially obvious, and it was motivated by recent results about comparison theorems for viscosity solutions of integro-differential equations in [27].

As mentioned earlier, for linear operators, the representation of (1.2) goes back to Courrège [19]. This was naturally connected with generators of Markov processes and boundary excursion processes for reflected diffusions. Hsu [32] provides a similar representation for the Dirichlet to Neumann map for the Laplacian in a smooth domain $\Omega$ , and this corresponds to studying the boundary process for a reflected Brownian motion. If $I$ is not necessarily linear but happens to satisfy the stronger local comparison principle, there are min-max results by many authors, e.g. Evans [21], Souganidis [53], Evans-Souganidis [22] and Katsoulakis [38]. In this case, the operator takes the form,

[TABLE]

which can be expressed as in Theorem 1.9, but with $\mu(x,dh)\equiv 0$ . This was extended to even include the possibility of weak solutions acting as a local semi-group on $BUC(\mathbb{R}^{d})$ , related to image processing, in Alvarez-Guichard-Lions-Morel [1], and to weak solutions of sets satisfying an order preserving set flow by Barles-Souganidis in [6]. In [1] it was shown under quite general assumptions that certain nonlinear semigroups must be represented as the unique viscosity solution to a degenerate parabolic equation.

Although it is still too early to tell, one hopes that theorems like those presented here can create a bridge between some nonlocal equations for which regularity questions arise and the known results about such equations when a min-max structured is known to hold. In the local setting, there are a number of results that leverage the min-max to shed new light on certain issues, and it would be interesting to see if similar things can be done for the nonlocal theory (see the discussion in [29, Section 1] for an incomplete list of such results). The types of regularity results that could find new applications via the min-max theorems here fall into roughly three categories: Krylov-Safonov type results; regularity for translation invariant equations; and Schauder type regularity results. For Krylov-Safonov, this means that solutions of fully nonlinear equations can be shown to enjoy Hölder estimates depending only on the $L^{\infty}$ norm of the solution; some examples are: [9], [14], [15], [37], and [49], among many others. For translation invariant equations, these are the results that show solutions to translation invariant equations very often enjoy $C^{1,\alpha}$ regularity under mild assumptions; some examples are: [9], [17], [41], [44], [50], among others. Finally, for Schauder regularity, we mean results that show that for $x$ -dependent operators, under certain regularity for the coefficients (such as Dini), solutions will have as much regularity as those equations with “constant coefficients”; some examples are: [20], [36], [43], among others. On top of questions of the type of Krylov-Safonov regularity mentioned above, there is another family of regularity results that accompanies existence and uniqueness techniques for viscosity solutions of elliptic partial-differential / integro-differential equations, and it is typically referred to as the Ishii-Lions method, going back to [34]. Both this Ishii-Lions regularity and comparison results could connect well with the operators treated in this paper, as many of the existing works on nonlocal equations assume a min-max. The types of results that could be applicable are like those in [2], [3], [4], [5], and [35], among others.

There is some more discussion of related works and background inside of the examples that we list in Section 6.

1.4. Another description of operators satisfying the GCP

Let us describe an elementary but useful way to view operators satisfying the GCP, which is also related to the min-max representation. First, we introduce a family of functional spaces.

Definition 1.16.

For $\beta\in[0,2^{+}]$ (using the abuse of notation in (1.13)) we define the space $L^{\infty}_{\beta}$ as follows. First, if $\beta\neq 1^{+}$ ,

[TABLE]

while for $\beta=1^{+}$ ,

[TABLE]

(We note the first space requires “Big-O”, while the second space requires “little-o”.) The spaces $L^{\infty}_{\beta}$ are Banach spaces, with norms given by

[TABLE]

Now, suppose we are given a continuous function

[TABLE]

Assume that this function is monotone (non-decreasing) with respect to the first two variables. Then, given $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ define

[TABLE]

where we are using the notation $\delta_{x}u(y):=u(x+y)-u(x)-\nabla u(x)\cdot y\chi_{B_{1}(0)}(y)$ for $\beta\geq 1$ , and $\delta_{x}u(y):=u(x+y)-u(x)$ for $\beta<1$ . It is clear the operator $I$ thus defined has the GCP.

Do all operators with the GCP arise in this form? It is easy to see that the answer is positive, at least when $\beta<2$ . Given $I:C^{\beta}(\mathbb{R}^{d})\to C^{0}(\mathbb{R})$ , with $\beta<2$ , we define a function

[TABLE]

by the formula $F(h,p,u,x):=I(\tau_{-x}h+\tau_{-x}p\cdot(\cdot)\chi_{B_{1}}+u,x)$ . It is straightforward to see that for $F$ so defined and $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ we have

[TABLE]

2. Real valued Lipschitz functions on Banach Spaces

In this section we review various well known facts about Lipschitz functions on Banach spaces, following Clarke’s book [18, Chapter 2]. We will refer most of the proofs to the relevant section in [18]. The section ends with Theorem 2.6 which yields a min-max formula for any real valued, Lipschitz $F$ , such a result is neither new nor surprising, but we present it here in complete detail for the sake of completeness.

We fix a Banach Space, denoted by $X$ , an open convex subset $\mathcal{K}\subset X$ , and a function

[TABLE]

which is assumed Lipschitz with constant $L>0$ , that is

[TABLE]

Definition 2.1.

The upper gradient of $F$ at $x\in\mathcal{K}$ in the direction of $v\in X$ , is defined as

[TABLE]

This can be seen as a function $F^{0}:\mathcal{K}\times X\to\mathbb{R}$ .

Proposition 2.2.

The function $F^{0}(x,v)$ has the following properties

(1)

For any $x\in\mathcal{K},v\in X$ , and $\lambda>0$ we have $F^{0}(x,\lambda v)=\lambda F^{0}(x,v)$ . 2. (2)

For any $x\in\mathcal{K}$ , and $v,w\in X$ we have $|F^{0}(x,v)-F^{0}(x,w)|\leq L\|v-w\|$ . 3. (3)

If $(x_{k},v_{k})\to(x,v)$ then $\limsup F^{0}(x_{k},v_{k})\leq F^{0}(x,v)$ . 4. (4)

$F^{0}(x,-v)=(-F)^{0}(x,v)$ .

Proof.

We refer the reader to [18, Proposition 2.1.1]. ∎

Definition 2.3.

The generalized gradient of $F$ at $x\in\mathcal{K}$ is the subset of $X^{*}$ given by

[TABLE]

We will denote by $\partial F$ the convex hull of the union of $\partial F(x)$ ,

[TABLE]

Proposition 2.4.

The set $\partial F(x)$ , $x\in\mathcal{K}$ , has the following properties

(1)

$\partial F(x)$ * is a non-empty, convex, $\textnormal{weak}^{*}$ -compact subset of $X^{*}$ .* 2. (2)

$\|\ell\|\leq L$ * for every $\ell\in\partial F(x)$ .* 3. (3)

For any $v\in X$ , we have that

[TABLE]

Proof.

We refer the reader to [18, Proposition 2.1.2].

∎

The following theorem, due to Lebourg, is a generalization of the mean value theorem for differentiable functions.

Theorem 2.5 (Lebourg’s Theorem).

Let $x,y$ be points in $\mathcal{K}$ . Then there exist $z$ of the form $z=tx+(1-t)y$ for some $t\in[0,1]$ , such that for some $\ell\in\partial F(z)$

[TABLE]

Proof.

We refer the reader to [18, Theorem 2.3.7].

∎

Using the generalized gradient and Lebourg’s theorem we can easily prove a min-max formula for Lipschitz functionals. Observe this is a general result for Lipschitz functionals in general Banach spaces, and it does not involve anything like GCP (functionals with the GCP on $C^{\beta}_{b}(\mathbb{R}^{d})$ are considered in the next section).

Theorem 2.6.

Let $F:\mathcal{K}\subset X\to\mathbb{R}$ be a Lipschitz function, with $\mathcal{K}$ convex, then for all $x\in\mathcal{K}$ ,

[TABLE]

Proof.

According to Theorem 2.5, given $x,y\in\mathcal{K}$ there is some $\ell\in\partial F$ such that

[TABLE]

In other words, for any $x$ and $y$ in $\mathcal{K}$ we have the inequality

[TABLE]

This also yields an equality for $y=x$ , thus $F(x)=\min\limits_{y\in\mathcal{K}}\max\limits_{\ell\in\partial F}\left\{F(y)+\langle\ell,x-y\rangle\right\}$ .

∎

3. Functionals with the GCP, revisited

Throughout this section $\mathcal{K}$ denotes an open convex set of $C^{\beta}_{b}(\mathbb{R}^{d})$ (see (1.13)). Moreover, for $\rho>0$ , we shall write

[TABLE]

Definition 3.1.

Let $F$ be a map $F:\mathcal{K}\subset C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ and $x\in\mathbb{R}^{d}$ . Such a functional is said to have the Global Comparison Property with respect to $x$ if $F(u)\leq F(v)$ for any pair of functions $u,v\in\mathcal{K}$ such that $u(y)\leq v(y)$ for all $y$ and $u(x)=v(x)$ –we will say in such a case that $v$ touches $u$ from above at $x$ .

The following two auxiliary functions will be useful throughout the section: Fix $\phi_{0}:\mathbb{R}\to\mathbb{R}$ , a nondecreasing $C^{\infty}$ function such that $0\leq\phi_{0}\leq 1$ , $\phi_{0}(x)=0$ for $x\leq 0$ , $\phi_{0}(x)=1$ for $x\geq 1$ . Then, given $r,R>0$ we define the functions

[TABLE]

The following Proposition was first proved in [29, Lemma 4.15, Corollary 4.16], we review the proof here for the reader’s convenience.

Proposition 3.2.

Suppose that $F:\mathcal{K}\subset C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ is a Lipschitz functional which has the $GCP$ with respect to $x$ . Fix $\rho>0$ . There is a constant $C(F,\rho)$ such that given $R>0$ , $r\in(0,1)$ , and $u,v\in\mathcal{K}_{\rho}$ , then

[TABLE]

Remark 3.3.

It is worth comparing Proposition 3.2 with Assumption 1.3. In the latter, one is interested in how $I(u,x)$ depends very little on the values of $u$ far away from $x$ (so, as $r\to\infty$ ), whereas the former deals with a weak version of this property that holds only for $r\in(0,1)$ but which follows alone from the GCP without the need for further assumptions on $F$ .

Proof.

Take $\phi\in C^{2}_{b}(\mathbb{R}^{d})$ , such that $0\leq\phi\leq 1$ and $\phi(x)=0$ . Then, for any $y$ we have

[TABLE]

with the above being an equality for $y=x$ . Now, let $\rho_{0}$ be chosen so that

[TABLE]

Then, let us suppose that $u,v\in\mathcal{K}_{\rho}$ are such that $\|u-v\|_{C^{\beta}_{b}(\mathbb{R}^{d})}\leq\rho_{0}$ . In this case, we have $w\in\mathcal{K}$ since $u\in\mathcal{K}_{\rho}$ and in this case the GCP says that

[TABLE]

Moreover, $F(w)\leq F(v)+L\|w-v\|_{C^{\beta}}$ and $w-v=(1-\phi)(u-v)+\phi\|u-v\|_{L^{\infty}(\textnormal{spt}(\phi))}$ , thus

[TABLE]

Consider the function $\phi(y)=\phi_{r,R}(y-x)$ . Thanks to $r\in(0,1)$ , the following estimates hold

[TABLE]

Substituting these in the inequality for $F(u)-F(v)$ , the desired inequality follows when $\|u-v\|_{C^{\beta}}$ is no larger than $\rho_{0}$ . Otherwise, $\|u-v\|_{C^{\beta}}\geq\rho_{0}$ and iterating the inequality in the previous case one obtains that

[TABLE]

∎

Lemma 3.4.

Let $F:\mathcal{K}\subset C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ be a Lipschitz functional which has the GCP with respect to $x$ . Then, for every $\ell\in\partial F$ we have

[TABLE]

In other words, if $F$ has the GCP with respect to $x$ , then any $\ell$ arising as a generalized gradient of $F$ also has the GCP with respect to $x$ . Furthermore, for any such $\ell$ and $r\in(0,1)$ we have

[TABLE]

Proof.

Let $u\in\mathcal{K}$ , and let $v\in C^{\beta}_{b}(\mathbb{R}^{d})$ be such that

[TABLE]

Then, $u_{t}=u+tv$ touches $u$ from below at $x$ for each small $t$ , therefore $F(u_{t})\leq F(u)$ for every $t$ , and

[TABLE]

Since,

[TABLE]

it follows that $\langle\ell,v\rangle\leq 0$ for any $\ell\in\partial F(u)$ , and the first part of the Lemma is proved. For the second part, one argues similarly, except that instead of invoking the GCP, one applies Proposition 3.2 in order to pass the same estimate for any $\ell\in\partial F$ . ∎

Fix a functional $\ell$ having the GCP with respect to $x$ . Then, define $C_{\ell}$ by

[TABLE]

This associates a constant $C_{\ell}$ to any $\ell$ having the GCP. Likewise, we shall associate a vector $B_{\ell}$ and positive semi-definite matrix $A_{\ell}$ . First, let us introduce some notation,

[TABLE]

Given $\phi,\eta\in\mathcal{S}$ , define the function

[TABLE]

For $x=0$ we will simply write $P_{\phi,\beta,u}$ . Observe that, for example, if $\beta=2$ then $P_{\phi,\eta,u,x}$ is a smooth function which, in a neighborhood of $x$ , coincides with the second order Taylor polynomial of the function $u$ at the point $x$ .

Definition 3.5.

Given any $\phi\in\mathcal{S}$ let $B_{\ell,\phi}$ be the vector defined by

[TABLE]

At the same time, given $\eta\in\mathcal{S}$ let $A_{\ell,\eta}$ be the symmetric matrix defined by

[TABLE]

The following lemmas will characterize all of functionals having the GCP with respect [math] (compare with Courrege’s original proof [19], see also [29]).

Lemma 3.6.

Let $\ell:C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ be a bounded linear functional which has the GCP with respect to [math], and $\phi,\eta\in\mathcal{S}$ (defined in (3.4)). There is a positive measure $\mu_{\ell}$ on $\mathbb{R}^{d}\setminus\{0\}$ with

[TABLE]

such that for any $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ we have the following representation,

[TABLE]

(The notation, $C^{2}(0)$ and $C^{1}(0)$ , appears in Definition 1.8.)

Remark 3.7.

We want to note that the dependence of $\mu$ only on $\ell$ is not a typo. Even though the vector $B_{\ell,\phi}$ and matrix $A_{\ell,\eta}$ clearly depend on the functions $\phi$ and $\eta$ , the reader can see in the proof in (3.9) that $\mu_{\ell}$ does not depend on $\phi$ or $\eta$ .

Proof.

It suffices to prove the representation formula for $u\in C^{2}_{b}(\mathbb{R}^{d})$ (even if $\beta\neq 2$ ), as it trivially extends to all of $C^{\beta}_{b}(\mathbb{R}^{d})$ by approximation. We fix $u\in C^{2}_{b}(\mathbb{R}^{d})\cap C^{2}(0)$ . We recall $P_{\phi,\eta,u}$ is defined in (3.8). Since $P_{\phi,\eta,u}\in C^{\beta}_{b}(\mathbb{R}^{d})$ for each fixed $\phi,\eta$ , we may write

[TABLE]

and linearity gives

[TABLE]

Let us study each of these two terms. Using the definition of $C_{\ell},B_{\ell,\phi},$ and $A_{\ell,\eta}$ , we have for $\beta\geq 2$

[TABLE]

as well as the corresponding expressions in the other cases when $\beta<2$ . Next, we analyze the second term in the expression for $\langle\ell,u\rangle$ above, that is

[TABLE]

First take the case $\beta\neq 1$ . Given $w\in C^{\beta}_{b}(\mathbb{R}^{d})$ , define $\tilde{w}$ by

[TABLE]

Observe that since $\beta\neq 1$ , the function $\tilde{1}=|x|^{\beta}(1+|x|^{\beta})^{-1}$ belongs to $C^{\beta}_{b}(\mathbb{R}^{d})$ . The linear transformation $w\mapsto\tilde{w}$ defines a linear functional $\tilde{\ell}$ via the relation

[TABLE]

This clearly defines a bounded functional on $C^{\beta}_{b}(\mathbb{R}^{d})$ . In fact, however, this functional extends uniquely to a bounded functional in $C^{0}_{b}(\mathbb{R}^{d})$ : since $\tilde{w}$ is touched from above at [math] by the function $\|w\|_{L^{\infty}}\tilde{1}$ , the GCP guarantees that

[TABLE]

This shows $\tilde{\ell}$ is a uniquely defined continuous functional on $C_{b}^{0}(\mathbb{R}^{d})$ whose norm as a functional on $C_{b}^{0}(\mathbb{R}^{d})$ is no larger than $\|\ell\|\|\tfrac{|x|^{\beta}}{1+|x|^{\beta}}\|_{C^{\beta}}$ . It follows there is a measure $\tilde{\mu}$ such that

[TABLE]

Moreover, since $\langle\tilde{\ell},w\rangle\geq 0$ whenever $w\geq 0$ , $\tilde{\mu}(dy)$ is a non-negative measure. Now, since $u\in C^{2}_{b}(\mathbb{R}^{d})$ , we have that the function

[TABLE]

remains continuous as $x\to 0$ , so $w\in C^{0}_{b}(\mathbb{R}^{d})$ and thus $\langle\tilde{\ell},w\rangle$ is well defined. In this case, we have

[TABLE]

and we obtain the formula

[TABLE]

In particular, taking $\mu(dy):=\tfrac{1+|y|^{\beta}}{|y|^{\beta}}\tilde{\mu}(dy)$ , it follows that

[TABLE]

and

[TABLE]

Revisiting the expression of $\ell$ , we have when $\beta\geq 2$

[TABLE]

and the analogous formulas follow for the other cases where $\beta\neq 1$ , per the change in definition of the function $P_{\phi,\eta,u}$ in (3.8). It remains to consider the case $\beta=1$ .

Since $|x|$ is not a $C^{1}$ function, we are going to approximate it by a more regular function. For every small $\varepsilon>0$ we repeat the argument above with $\beta=1+\varepsilon$ and conclude that for some $\mu_{\varepsilon}$ we have the formula

[TABLE]

and this measure $\mu_{\varepsilon}$ is positive and satisfies the bound

[TABLE]

Since

[TABLE]

it follows that the respective finite measures $\{\tilde{\mu}_{\varepsilon}\}_{\varepsilon\in(0,1)}$ have uniformly bounded mass. Therefore, it is not difficult to show (using $\ell$ to get tightness for the $\tilde{\mu}_{\varepsilon}$ ) that along a subsequence $\varepsilon\to 0$ we can find a limit $\tilde{\mu}$ , and if we let $\mu:=(1+|y|)|y|^{-1}\tilde{\mu}$ then

[TABLE]

and again, for any $u\in C^{2}_{b}(\mathbb{R}^{d})$ ,

[TABLE]

∎

We consider the following special functions. For $\delta>0$ , define (see (3.2) for definition of $\psi_{r,R}$ )

[TABLE]

Note that $\phi_{\delta}\equiv 1$ inside $B_{1-2\delta}$ and $\phi_{\delta}\equiv 0$ outside $B_{1-\delta}$ , while $\eta_{\delta}\equiv 1$ inside $B_{\delta}$ and $\eta_{\delta}\equiv 0$ outside $B_{2\delta}$ . Furthermore, we note that $\delta\leq\delta^{\prime}$ implies that $\eta_{\delta}\leq\eta_{\delta^{\prime}}$ .

Lemma 3.8.

Assume that $\beta\in[0,3)$ , $l:C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ is a bounded linear functional with the GCP with respect to [math], and that $A_{\ell,\eta}$ , $B_{\ell,\phi}$ are as in Definition 3.5. Taking $\eta_{\delta}$ as in (3.11), the limit

[TABLE]

exists for all $\beta\in[0,3)$ , and $A_{\ell}\equiv 0$ if $\beta<2$ . Moreover, if $\phi_{\delta}$ is as in (3.10), there is a sequence $\delta_{k}\searrow 0$ such that the following limit exists

[TABLE]

Proof.

Let $\eta_{1},\eta_{2}\in\mathcal{S}$ and such that $\eta_{1}\leq\eta_{2}$ . Then for any positive semi-definite $M$ we have

[TABLE]

Since $\ell$ has the GCP with respect to [math], it follows that

[TABLE]

From this monotonicity and the elementary inequality $|\langle\ell,\tfrac{1}{2}\eta(x)(Mx,x)\rangle|\leq C|M|\max_{ij}\|\eta x_{i}x_{j}\|_{C^{\beta}}$ we conclude that the following limit exists for every positive semi-definite $M$

[TABLE]

At the same time, when $\beta<2$ we have $\|\eta_{\delta}x_{i}x_{j}\|_{C^{\beta}}\to 0$ as $\delta\searrow 0$ for all $i,j$ , so in this case the limit is zero. Now, given a symmetric matrix $M$ , write $M=M^{+}-M^{-}$ , where both $M^{+}$ and $M^{-}$ are positive semi-definite. Then, we also have that the limit

[TABLE]

exists for any symmetric matrix $M$ . It is clear then that this limit is linear as a function of $M$ , and therefore, there is a unique symmetric matrix $A_{\ell}$ such that

[TABLE]

Moreover, this matrix $A_{\ell}$ is positive semi-definite and $A_{\ell,\eta_{\delta}}\to A_{\ell}$ as $\delta\searrow 0$ , and $A_{\ell}=0$ when $\beta<2$ . It remains to analyze the limit of $B_{\ell,\phi_{\delta}}$ along a subsequence. For every $\delta\in(0,1)$

[TABLE]

Now, recall the estimate from Lemma 3.4, which implies

[TABLE]

A direct computation shows that

[TABLE]

It follows that

[TABLE]

and by compactness, there must be a subsequence $\delta_{k}\to 0$ for which $\{B_{\ell,\phi_{\delta_{k}}}\}_{k}$ converges.

∎

Lemma 3.9.

Assume that $\beta\in[0,3)$ . Let $\ell:C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ be a bounded linear functional which has the GCP with respect to [math]. For $\beta\geq 2$ and any $u\in C^{\beta}_{b}(\mathbb{R}^{d})\cap C^{2}(0)$ , we have the representation

[TABLE]

This representation is unique. This means that if there were $\tilde{C}$ , $\tilde{B}$ , $\tilde{A}$ and $\tilde{\mu}$ a measure in $\mathbb{R}^{d}\setminus\{0\}$ all such that

[TABLE]

for all $u$ , then $\tilde{C}=C_{\ell}$ , $\tilde{B}=B_{\ell}$ , $\tilde{A}=A_{\ell}$ , and $\tilde{\mu}=\mu_{\ell}$ . Furthermore, if $\beta<2$ and $u\in C^{\beta}(\mathbb{R}^{d})\cap C^{1}(0)$ , then $A_{\ell}=0$ , and if $\beta<1$ , then $B_{\ell}=0$ and the integrand on the right can be replaced with just $u(y)-u(0)$ .

Proof.

Let $\delta,\delta^{\prime}\in(0,1)$ . Applying Lemma 3.6 with the functions $\phi_{\delta}$ and $\eta_{\delta^{\prime}}$ ,

[TABLE]

Since $\min\{1,|y|^{\beta}\}$ is integrable against $\mu_{\ell}$ , it follows that

[TABLE]

Therefore,

[TABLE]

Then, thanks to Lemma 3.8, the formula for $\langle\ell,u\rangle$ becomes (for every fixed $\delta\in(0,1)$ )

[TABLE]

Now, let $\delta_{k}\searrow 0$ be chosen so that $B_{\ell\phi_{\delta_{k}}}\to B_{\ell}$ (which can be done thanks to Lemma 3.8). From the definition of $\phi_{\delta}$ , we have that

[TABLE]

At the same time, for every $y\in\mathbb{R}^{d}$ we have

[TABLE]

Therefore, by monotone convergence we have

[TABLE]

From where it follows that

[TABLE]

as claimed. It remains to prove the uniqueness part. For this, it is enough to show that if for all $u$ we have $\langle\ell,u\rangle=0$ and

[TABLE]

then $C_{\ell}=0,B_{\ell}=0,A_{\ell}=0$ $\mu_{\ell}=0$ . First, consider any $u$ with compact support which is disjoint from $\{0\}$ , for such a $u$ we have

[TABLE]

Since $u$ can be any function with compact support in $\mathbb{R}^{d}\setminus\{0\}$ , it follows that $\mu_{\ell}=0$ . Evaluating $\ell$ at the function $u(x)\equiv 1$ we obtain $C_{\ell}=0$ . Lastly, evaluating $\ell$ at all of the functions of the form $(x,e)$ , $e\in\mathbb{R}^{d}$ and $(Mx,x)$ , $M$ symmetric matrix, we see that $B_{\ell}\cdot e=0$ for any vector $e$ and $\textnormal{tr}(AM)=0$ for any symmetric matrix $M$ , so that $B_{\ell}=0$ and $A_{\ell}=0$ .

∎

By a simple change of variables, Lemma 3.9 implies the following.

Corollary 3.10.

Assume that $x$ is fixed, $\beta\in[0,3)$ , and let $\ell:C^{\beta}_{b}(\mathbb{R}^{d})\to\mathbb{R}$ be a bounded linear functional which has the GCP with respect to $x$ . For $\beta\geq 2$ any $u\in C^{\beta}_{b}(\mathbb{R}^{d})\cap C^{2}(x)$ we have the representation

[TABLE]

As before, this representation is unique, and when $\beta<2$ and $u\in C^{\beta}_{b}(\mathbb{R}^{d})\cap C^{1}(x)$ , we have $A_{\ell}=0$ , while for $\beta<1$ we have $B_{\ell}=0$ and the integrand can be replaced with just $u(x+y)-u(x)$ .

3.1. Proofs of Theorems 1.9 and 1.10

With Lemmas 3.4 and 3.9 and Corollary 3.10 in hand, we can now prove Theorems 1.10 and 1.9.

Proof of Theorem 1.10.

Consider the functional,

[TABLE]

Now, by Theorem 2.6, we have that

[TABLE]

By Lemma 3.4, each $\ell_{ab}$ is a linear operator having the GCP with respect to [math], in which case Lemma 3.9 says that for $u\in C^{\beta}_{b}(\mathbb{R}^{d})\cap C^{2}(0)$ ,

[TABLE]

The translation invariance of $I$ boils down to the identity

[TABLE]

Therefore,

[TABLE]

However, $\langle\ell_{ab},\tau_{x}u\rangle$ has a simple expression, namely

[TABLE]

and this proves the theorem. ∎

Proof of Theorem 1.9.

The beginning of the proof is similar to that of the previous one. For each $x\in\mathbb{R}^{d}$ , define a functional

[TABLE]

Applying Theorem 2.6, it follows that

[TABLE]

Applying Lemma 3.4, it follows that for any $\ell\in\partial F_{x}$

[TABLE]

Since $F_{x}(v)=I(v,x)$ this proves the Theorem, with $\mathcal{K}(I)_{x}=\{L\mid L(u)=\langle\ell,u\rangle\textnormal{ for }\ell\in\partial F_{x}\}$ .

∎

Remark 3.11.

It is worthwhile to compare the proof of Theorem 1.9 above to the much longer and complicated one given in [29]. The simplicity here is made possible by the use of a mean value theorem for Lipschitz functionals (Theorem 2.5) in the infinite dimensional setting, which suffices to prove Theorem 1.9 as it involves a min-max formula in terms of linear functionals in $C^{2}_{b}$ and not linear operators from $C^{2}_{b}(\mathbb{R}^{d})$ to $C^{0}_{b}(\mathbb{R}^{d})$ . The more complicated method from [29] is however still of value, specially if one is interested in obtaining a min-max representation in terms of a family of linear operators from $C^{2}_{b}$ to $C_{b}^{0}$ . Moreover, it is by adapting the method from [29] that we are able to prove Theorem 1.11, after analyzing the spatial properties of the finite dimensional approximations (see in Section 5).

4. Finite Dimensional Approximations to $C^{\beta}_{b}(\mathbb{R}^{d})$

4.1. Graph approximations

The following nested family of sets will be important in what follows

[TABLE]

It will be convenient to write $h_{n}:=2^{-n}$ . Then, $h_{n}$ represents the maximum possible distance between $x\in\mathbb{R}^{d}$ and $G_{n}$ , and in particular $\textnormal{dist}(x,G_{n})\leq h_{n}$ for all $x\in\mathbb{R}^{d}$ . Observe that

[TABLE]

and note also the union of the sets $G_{n}$ is dense in $\mathbb{R}^{d}$ .

Definition 4.1.

We consider the following function spaces

[TABLE]

These spaces will be related to $C^{\beta}_{b}(\mathbb{R}^{d})$ by restriction, which we think of as a map denoted by $T_{n}$ and given by

[TABLE]

Remark 4.2.

The space $C_{*}(G_{n})$ is a finite dimensional vector space.

4.2. Cube decomposition and partition of unity

In this section we shall apply the Whitney theory to extend functions in a grid $r\mathbb{Z}^{d}$ to all of $\mathbb{R}^{d}$ . Since it is in our interest for the Whitney construction to be compatible with the grid structure, we shall do the usual cube decomposition making sure the resulting family of cubes is invariant under translations by vectors in $r\mathbb{Z}^{d}$ , the resulting construction is illustrated in Figure 1.

Lemma 4.3.

For every $r>0$ , there exists a collection of cubes $\{Q_{k}\}_{k}$ such that

(1)

The cubes $\{Q_{k}\}_{k}$ have pairwise disjoint interiors. 2. (2)

The cubes $\{Q_{k}\}_{k}$ cover $\mathbb{R}^{d}\setminus r\mathbb{Z}^{d}$ 3. (3)

$c_{1}\textnormal{diam}(Q_{k})\leq\textnormal{dist}(Q_{k},\mathbb{Z}^{d})\leq c_{2}\textnormal{diam}(Q_{k}).$ ** 4. (4)

For every $h\in r\mathbb{Z}^{d}$ , there is a bijection $\sigma_{h}:\mathbb{N}\to\mathbb{N}$ such that $Q_{k}+h=Q_{\sigma_{h}k}$ for every $k\in\mathbb{N}$ .

Proof.

We consider the case $r=1$ , once the collection of cubes is $\{Q_{k}\}_{k}$ obtained in this case, the general case follows via scaling by taking the family $\{rQ_{k}\}_{k}$ .

Consider the cube $Q_{0}=[-1/2,1/2]^{d}$ , let $\mathcal{M}_{0}$ denote the family of $2^{d}$ equal size cubes obtained from $Q_{0}$ by bisecting each of its sides. Let $\mathcal{M}_{k}$ denote the family of cubes obtained from applying this same procedure to each of the cubes in $\mathcal{M}_{k-1}$ . Note that the side length of each cube in $\mathcal{M}_{k}$ is just $2^{-k}$ . Now, we construct a family $\mathcal{F}_{0}$ as follows, with $R_{k}:=\{2\sqrt{d}2^{-k}\leq|x|\leq 2\sqrt{d}2^{-(k-1)}\}$ for each $k\in\mathbb{N}$ , then

[TABLE]

Observe that if $Q\in\mathcal{F}_{0}$ then $Q\in\mathcal{M}_{k}$ for some $k$ and there is some $x\in Q$ such that $2\sqrt{d}2^{-k}\leq|x|$ and $|x|\leq 2\sqrt{d}2^{-(k-1)}$ . This means,

[TABLE]

and since $\textnormal{diam}(Q)=\sqrt{d}2^{-k}$ , we conclude that

[TABLE]

On the other hand, we have that

[TABLE]

If $\mathcal{F}$ denotes the subfamily of maximal cubes in $\mathcal{F}_{0}$ , it follows that: the union of these cubes is still $[-1/2,1/2]^{d}\setminus\{0\}$ , the inequality $\textnormal{diam}(Q)\leq\textnormal{dist}(Q,0)\leq 4\textnormal{diam}(Q)$ holds for each $Q\in\mathcal{F}$ , and the cubes have pairwise disjoint interiors.

Denote by $\{Q_{k}\}_{k}$ an enumeration of the family of cubes of the form $Q+z$ , where $Q\in\mathcal{F}$ and $z\in\mathbb{Z}^{d}$ . It is clear that $\{Q_{k}\}_{k}$ covers all of $\mathbb{R}^{d}\setminus\mathbb{Z}^{d}$ and that these cubes have pairwise disjoint interiors. Furthermore, for any $h\in\mathbb{Z}^{d}$ the map $Q\to Q+h$ gives a bijection of the set $\{Q_{k}\}_{k}$ onto itself, therefore one can represent it via a bijection $\sigma_{h}:\mathbb{N}\to\mathbb{N}$ so that $Q_{k}+h=Q_{\sigma_{h}k}$ . Last but not least, as each cube of the form $Q+z$ is closest to $z$ than to any other point in $\mathbb{Z}^{d}$ , property (3) follows from the respectively inequality for the family $\mathcal{F}$ .

∎

Remark 4.4.

We apply Lemma 4.3 with $r=2^{-n}$ , for some $n\in\mathbb{N}$ , and for the rest of the section shall refer to the resulting cubes as $\{Q_{n,k}\}_{k}$ .

Furthermore, for every $n$ and $k$ , we will denote the center of $Q_{n,k}$ by $y_{n,k}$ , and for each $n$ and $k$ we will denote by $\hat{y}_{n,k}$ the unique point in $G_{n}$ such that

[TABLE]

(note that there is only one since by construction not a single center $y_{n,k}$ lies at equidistance to two different lattice points).

In particular, for each of the bijections $\sigma_{h}:\mathbb{N}\to\mathbb{N}$ from Lemma 4.3 we have

[TABLE]

Remark 4.5.

In all what follows, given a cube $Q$ , we shall denote by $Q^{*}$ the cube with same center as $Q$ but whose sides are increased by a factor of $9/8$ . Observe that for every $n$ and $k$ , we have $Q_{n,k}^{*}\subset\mathbb{R}^{d}\setminus 2^{2-n}\mathbb{Z}^{d}$ , and that any given $x$ lies in at most some number $C(d)$ of the cubes $Q_{k}^{*}$ .

Proposition 4.6.

For every $n$ , there is a family of functions $\phi_{n,k}(x)$ such that

(1)

$0\leq\phi_{n,k}(x)\leq 1$ * for every $k$ and $\phi_{n,k}\equiv 0$ outside $Q_{n,k}^{*}$ (using the notation in Remark 4.5)* 2. (2)

$\sum_{k}\phi_{n,k}(x)=1$ * for every $x\in\mathbb{R}^{d}\setminus G_{n}$ .* 3. (3)

There is a constant $C$ , independent of $n$ and $k$ , such that

[TABLE] 4. (4)

For every $z\in G_{n}$ , we have

[TABLE]

where $\sigma_{z}$ are the bijections introduced above.

Proof.

Fix a $C^{\infty}$ function $\phi$ such that

[TABLE]

Let $\ell(Q)$ denote the common length for the sides of $Q_{n,k}$ , and with $y_{n,k}$ as given in Remark 4.4 we define

[TABLE]

Consider the function

[TABLE]

It follows from Remark 4.5 that given any $x$ ,at most $C(d)$ of the terms appearing in the sum are non-zero in a neighborhood of $x$ , and therefore $\Phi$ is a smooth function. Then, define

[TABLE]

It is clear that the functions $\{\phi_{n,k}\}_{k}$ satisfy properties (1) and (2). Property (3) follows easily from the chain rule, using the differentiability of the function $\phi$ . It remains to check property (4), let $z\in G_{n}$ , then

[TABLE]

where we used that $\ell(Q_{n,k})=\ell(Q_{n,\sigma_{z}k})$ , which follows clearly from the definition of $\sigma_{z}$ . ∎

4.3. Discrete derivatives

In what follows, it will be in our interest to approximate the first and second derivatives of a function $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ (see (1.13) for our convention regarding the meaning of $C^{\beta}_{b}$ ) at a point $x\in G_{n}$ using only information about the values of $u$ on $G_{n}$ . This motivates the following two definitions (we recall that $h_{n}=2^{-n}$ ).

Definition 4.7.

The vector $(\nabla_{n})^{1}u(x)$ is defined via the system of equations ( $k=1,\ldots,d$ )

[TABLE]

Definition 4.8.

The matrix $(\nabla_{n})^{2}u(x)$ is defined via the system of equations ( $k,\ell=1,\ldots,d$ ),

[TABLE]

Remark 4.9.

From the definition it is clear that these discrete derivatives commute with translations with respect to a vector $z\in G_{n}$ . That is, given a function $u$ and $z\in G_{n}$ then for every $x\in G_{n}$ we have

[TABLE]

Depending on how regular the function $u$ is, these discrete derivative operators enjoy quantitative “continuity estimates” as functions on $G_{n}$ . An important point being that these estimates are uniform in $n$ once $u$ is fixed.

Proposition 4.10.

There is a universal constant $C$ such that for $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ and $x\in G_{n}$ ,

[TABLE]

Proof.

See appendix.

∎

Proposition 4.11.

Fix $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ . Then, given $x_{1},x_{2}\in G_{n}$ , we have

[TABLE]

Proof.

See appendix.

∎

4.4. The Whitney Extension and Projection operators.

Definition 4.12.

[TABLE]

We are now ready to define the Whitney extension operator.

[TABLE]

The projector operator $\pi_{n}^{\beta}:C^{\beta}_{b}(\mathbb{R}^{d})\to C^{\beta}_{b}(\mathbb{R}^{d})$ is given by

[TABLE]

where we recall that $T_{n}u=u_{\mid G_{n}}$ (Definition 4.1).

Theorem 4.13.

There is a constant $C$ such that for any $n$ and any $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ we have

[TABLE]

Proof.

This follows arguing exactly as in [54, Chapter VI, Theorem 3 and 4], making use of the regularity estimates in Proposition 4.11. Since this is a standard argument, we omit the details. ∎

Proposition 4.14.

Let $z\in G_{n}$ and $u\in C^{\beta}_{b}$ , then.

[TABLE]

Proof.

Let us show that $\pi_{n}^{\beta}(\tau_{z}u)(x)=\tau_{z}\pi_{n}^{\beta}(u)(x)$ for every $x\in\mathbb{R}^{d}$ and $z\in G_{n}$ . Note that if $x\in G_{n}$ then the equality is trivial, so let us take $x\in\mathbb{R}^{d}\setminus G_{n}$ and $z\in G_{n}$ , then we have

[TABLE]

Furthermore, it is not difficult to check that (see Remark 4.9)

[TABLE]

while part (4) of Proposition 4.6 implies that

[TABLE]

From these two identities we conclude that

[TABLE]

where we used that $\sigma_{z}$ is bijective, this proves the proposition. ∎

Remark 4.15.

Given $\varepsilon\in(0,1)$ there is a $C>1$ such that for every $n\in\mathbb{N}$ , $x_{0}\in G_{n}$ , and unit vector $x_{*}\in\mathbb{R}^{d}$ there is some $x_{1}\in G_{n}$ and $s>0$ such that

[TABLE]

Indeed, this follows from the fact that $h_{n}^{\varepsilon}x_{*}\in[-h_{n}^{\varepsilon},h_{n}^{\varepsilon}]^{d}$ and that $[-h_{n}^{\varepsilon},h_{n}^{\varepsilon}]^{d}\cap(G_{n}-x_{0})$ is a $h_{n}$ -net in $[-h_{n}^{\varepsilon},h_{n}^{\varepsilon}]^{d}$ , so there is $x_{1}\in[-h_{n}^{\varepsilon},h_{n}^{\varepsilon}]^{d}\cap(G_{n}-x_{0})$ such that $|h_{n}^{\varepsilon}x_{*}-(x_{1}-x_{0})|\leq h_{n}$ . Then, the inequalities for $|x_{1}-x_{0}|$ follow from two applications of the triangle inequality and the fact that $\varepsilon<1$ and $h_{n}\leq 1/2$ for all $n\geq 1$ .

Proposition 4.16.

Let $w\in C^{\beta}_{b}(\mathbb{R}^{d})$ be such that $w(x)\geq 0$ for every $x\in G_{n}$ and such that $w(x_{0})=0$ at some $x_{0}\in G_{n}$ . Then, there is a universal $C$ such that

[TABLE]

Here, for a given symmetric matrix $D$ , $D_{-}$ denotes it’s negative part.

Proof.

Fix any $x\in G_{n}$ . Thanks to Proposition 4.10 and the fact that $|x-x_{0}|\geq h_{n}$ we have

[TABLE]

Since $w(x_{0})=0$ , and $w(x)\geq 0$ by assumption,

[TABLE]

It is easy to see there is some $x_{1}\in G_{n}$ such that $|x_{0}-x_{1}|=h_{n}$ and

[TABLE]

and therefore,

[TABLE]

Combining these inequalities and recalling Theorem 4.13 it follows that

[TABLE]

This proves the estimate for the gradient when $\beta\geq 1$ . Now assume $\beta\geq 2$ , the beginning of the argument in this case goes along similar lines. For any $x\in G_{n}$ we have that

[TABLE]

where we have once again used Theorem 4.13. Thus, since $w(x_{0})=0$ and $w(x)\geq 0$ for $x\in G_{n}$ ,

[TABLE]

Now, since we are on a lattice, it is obvious that for any $x\in G_{n}$ we have that $x^{\prime}:=2x_{0}-x\in G_{n}$ . In this case we can add up the inequalities for $x$ and $x^{\prime}$ , and conclude that

[TABLE]

Since $x^{\prime}-x_{0}=-(x-x_{0})$ , we conclude that

[TABLE]

Let $x_{*}\in\mathbb{R}^{d}$ be a unit vector such that

[TABLE]

According to Remark 4.15, there is $x_{1}\in G_{n}$ and $s>0$ such that

[TABLE]

For this $x_{1}$ we have

[TABLE]

This, together with the previous step, shows that

[TABLE]

again having used Theorem 4.13. Simplifying, this becomes

[TABLE]

Choosing $\varepsilon=1/2$ , and noting $\min\{3,\beta\}-2)\leq 1$ , we conclude that

[TABLE]

∎

We fix an auxiliary function $\eta_{0}:[0,\infty)\to\mathbb{R}_{+}$ , with $\eta_{0}\in C^{\infty}(\mathbb{R}_{+})$ , and

[TABLE]

The function $\eta_{0}$ , as well as the following two estimates, will be useful in the next section. Essentially, $\eta_{0}(t)$ should be thought of as a smooth replacement for $\min\{1,t\}$ .

Lemma 4.17.

Let $1\leq\beta<\beta_{0}<3$ , and consider $w\in C^{\beta_{0}}_{b}(\mathbb{R}^{d})$ and $x_{0}\in G_{n}$ such that

[TABLE]

Then, there is a function $R_{\beta_{0},n,w,x_{0}}$ such that $R(x_{0})=0$ , and

[TABLE]

for some constant $\gamma=\gamma(\beta,\beta_{0})\in(0,1)$ .

Remark 4.18.

For $\beta\in(0,1)$ , it is straightforward that $w\geq 0$ in $G_{n}$ guarantees that $\pi_{n}^{\beta}w\geq 0$ everywhere, that is, the Whitney extension for $\beta\in(0,1)$ is order preserving. Accordingly, Lemma 4.17 is only needed for $\beta>1$ .

Proof.

We consider the cases $1\leq\beta<2$ and $\beta\geq 2$ separately. First suppose $\beta\in[1,2)$ . Let $\phi_{0}(t)$ be a smooth function such that $0\leq\phi_{0}(t)\leq 1$ for all $t$ , $\phi_{0}(t)=1$ for $t\leq 1/4$ and $\phi_{0}(t)=0$ for $t\geq 1$ . Then set

[TABLE]

For each $x\in\mathbb{R}^{d}$ , let $\hat{x}$ denote a point in $G_{n}$ such that $|x-\hat{x}|=\textnormal{dist}(x,G_{n})\leq h_{n}$ . Then, since $w(\hat{x})\geq 0$ for any $\hat{x}$ (from the assumption), we have

[TABLE]

By Proposition 4.16, we have $|\nabla\pi^{\beta}_{n}w(x_{0})|\leq C\|w\|_{C^{\beta_{0}}}h_{n}$ when $\beta_{0}>1$ , therefore,

[TABLE]

where we have used Theorem 4.13 to bound $\|\pi_{n}^{\beta}w\|_{C^{\beta}_{0}}$ . On the other hand, since $\beta_{0}>1$ and $\nabla\tilde{w}(x_{0})=0$ , we have

[TABLE]

Now, we take $\eta_{0}$ as in (4.5) and define the function

[TABLE]

If $|x-x_{0}|^{\beta_{0}}\geq h_{n}/2$ , then

[TABLE]

If on the contrary, $|x-x_{0}|^{\beta_{0}}\leq h_{n}/2$ , then

[TABLE]

We conclude that

[TABLE]

On the other hand, an elementary computation (see the Appendix) shows that

[TABLE]

Finally, let

[TABLE]

We conclude that $\|R_{\beta_{0},n,w,x_{0}}\|_{C^{\beta}}\leq Ch_{n}^{\gamma}\|w\|_{C^{\beta_{0}}}$ and

[TABLE]

This proves the Proposition when $\beta\in[1,2)$ . The argument for $\beta\geq 2$ is similar, we only highlight the main differences. This time, we subtract not just the first order part of $w$ near $x_{0}$ , but also the second order part, namely we consider the function

[TABLE]

Then, one applies again Proposition 4.16 and use the regularity of $w$ to obtain (in analogy to the previous case)

[TABLE]

The respective function $\tilde{\tilde{R}}$ is defined exactly as $\tilde{R}$ and one argues as in the previous case. ∎

Remark 4.19.

The argument in the proof provides -after small modifications- a closely related result: if instead of $w\in C^{\beta}_{b}(\mathbb{R}^{d})$ we assume that $w\in C^{0}_{b}(\mathbb{R}^{d})$ and that for some $M>0$ and $\beta_{0}>\beta$ we have

[TABLE]

then there is as before a function $\hat{R}_{\beta_{0},n,w,x_{0}}$ such that $\hat{R}_{\beta_{0},n,w,x_{0}}(x_{0})=0$ and $\pi_{n}^{\beta}w(x)+R_{\beta_{0},n,w,x_{0}}(x)\geq 0$ for all $x$ , but this time the $C^{\beta}$ estimate for $\hat{R}_{\beta_{0},n,w,x_{0}}$ is

[TABLE]

The following proposition will be useful later in the proof of Proposition 5.8.

Proposition 4.20.

Let $1\leq\beta<\beta_{0}<3$ or $\beta\in(0,1)$ and $\beta_{0}=\beta$ . Fix $f\in C^{\infty}_{c}(\mathbb{R}^{d})$ , and let $\eta_{0}$ be as in (4.5). Let $x_{0}\in G_{n}$ and $w(x)=f(x-x_{0})\eta_{0}(|x-x_{0}|^{\beta_{0}})$ , then

[TABLE]

for some function $\hat{R}_{\beta_{0},n,w,x_{0}}$ such that $\hat{R}_{\beta_{0},n,w,x_{0}}(x_{0})=0$ and

[TABLE]

where $\gamma$ is as in Lemma 4.17.

Proof.

Define the function $\tilde{w}(x):=(\|f\|_{L^{\infty}}-f(x-x_{0}))\eta_{0}(|x-x_{0}|^{\beta_{0}})$ . Then $\tilde{w}(x_{0})=0$ and

[TABLE]

while, since $\eta_{0}\geq 0$ , we also have $\tilde{w}(x)\geq 0$ for every $x\in G_{n}$ . If $\beta\in[1,2]$ , using Lemma 4.17 and the function $\hat{R}_{\beta_{0},n,w,x_{0}}$ from Remark 4.19, we have

[TABLE]

This inequality, after some rearranging, yields (for $\beta\in[1,2]$ )

[TABLE]

Since we also have $\|\tilde{w}\|_{L^{\infty}}\leq C\|f\|_{L^{\infty}}$ , we have again by Remark 4.19

[TABLE]

and the Proposition is proved in this case. For $\beta\in(0,1)$ we argue along similar lines, using Remark 4.18 instead of Lemma 4.17.

∎

4.5. Convergence of the projection operators

Lemma 4.21.

Let $0<\beta<\beta_{0}<3$ , there is a constant $C$ such that if $u\in C^{\beta_{0}}_{b}(\mathbb{R}^{d})$ , then

[TABLE]

Here, $\gamma=\gamma(\beta_{0},\beta)\in(0,1)$ .

Proof.

For notational simplicity let us write $f(x)=\pi_{n}^{\beta}u(x)$ throughout the proof.

Since $u=f$ throughout $G_{n}$ , for an arbitrary $x\in G_{n}$ we have (with $\hat{x}$ denoting a point in $G_{n}$ such that $\textnormal{dist}(x,G_{n})=|x-\hat{x}|$ ), with $\alpha:=\min\{1,\beta_{0}\}$

[TABLE]

where we made use of Theorem 4.13 to obtain $[f]_{C^{\alpha}}\leq C\|u\|_{C^{\beta}}$ . This shows that $\|u-f\|_{L^{\infty}}$ goes to zero at some rate determined by $\beta_{0}$ and the size of $\|u\|_{C^{\beta_{0}}}$ . To prove the lemma we need to also bound the Hölder seminorm of $u-f$ and its derivatives, according to $\beta_{0}$ .

The case $\beta,\beta_{0}\in[0,1)$ . Fix $x_{1},x_{2}\in\mathbb{R}^{d}$ . First, suppose that $|x_{1}-x_{2}|\leq\max\{|x_{1}-\hat{x}_{1}|,|x_{2}-\hat{x}_{2}|\}$ , then

[TABLE]

In this case, and since $0\leq\beta<\beta_{0}<1$ , we have that $|x_{1}-x_{2}|^{\beta_{0}-\beta}\leq\max\{|x_{1}-\hat{x}_{1}|^{\beta_{0}-\beta},|x_{2}-\hat{x}_{2}|^{\beta_{0}-\beta}\}\leq h_{n}^{\beta_{0}-\beta}$ . Then, using Theorem 4.13

[TABLE]

Next, suppose that $|x_{1}-x_{2}|>\max\{|x_{1}-\hat{x}_{1}|,|x_{2}-\hat{x}_{2}|\}$ . In this case

[TABLE]

where once again Theorem 4.13 was used. Combining these two estimates, we conclude that

[TABLE]

Then, using that $h_{n}\leq 1$ for all $n\geq 1$ , we have

[TABLE]

The case $\beta,\beta_{0}\in[1,2)$ . In this case we trivially have the same estimates from the previous case, and only need the bounds for first derivative. This is done as follows, first

[TABLE]

Then, using Theorem 4.13, we have

[TABLE]

Recall that $\nabla f(\hat{x})=(\nabla_{n})^{1}u(\hat{x})$ , and use Proposition 4.10 to conclude that

[TABLE]

The Hölder seminorm $[\nabla f-\nabla u]_{C^{\beta}}$ is bounded with the same argument used to bound $[f-u]_{C^{\beta}}$ in the previous case, we omit the details.

The case $\beta=2,\beta_{0}\in(2,3)$ . Right as before, we note that

[TABLE]

Then, applying Theorem 4.13 and Proposition 4.10 as in the previous case, we have

[TABLE]

For the Hölder seminorm, we repeat the argument used in the case $\beta\in(0,1)$ , again we leave the details to the reader. ∎

Remark 4.22.

If $u\in C^{0}_{b}(\mathbb{R}^{d})$ , then the same argument from Lemma 4.21 can be used to show

[TABLE]

the rate of convergence being determined by the modulus of continuity of $u$ .

5. Analysis of $I(u,x)$ via the finite dimensional approximations

In this section we introduce a sequence of operators $I_{n}$ which approximate $I$ . The operators $I_{n}$ behave like operators in a finite dimensional vector space in the sense that they arise from a composition between linear maps with a Lipschitz map from a finite dimensional space onto itself. This allows us to prove a min-max formula for $I_{n}(u,x)$ at least when $x\in G_{n}$ by using Clarke’s idea of a generalized gradient [18]. More precisely, we use the fact that $I_{n}$ factorizes via a map between finite dimensional vector spaces (which is what the spaces $C_{*}(G_{n})$ were introduced for), where the generalized gradient can be used, and then lift this to corresponding maps from $C^{\beta}_{b}(\mathbb{R}^{d})$ to $C_{b}^{0}(\mathbb{R}^{d})$ using the Whitney extension. The majority of the section is concerned with deriving estimates and regularity properties for the linear operators arising in the min-max formula for $I_{n}$ , and ultimately concluding such linear operators are pre-compact, which leads to a min-max formula for the original operator.

5.1. The operators $I_{n}$ and their min-max representation

We are going to approximate the operator $I(\cdot,x)$ via “finite dimensional approximations”, this referring to maps $I_{n}:C^{\beta}_{b}\to C^{0}_{b}$ , which factorize through a finite dimensional space (see (5.3) below).

We introduce a modification of the projection operator $\pi_{n}^{0}$ defined in (4.4). First, we define

[TABLE]

That is, given $u\in C(G_{n})$ , we define $\textnormal{Pr}_{n}(u)$ as the function obtained by restricting $u$ to $G_{n}\cap[-2^{n},2^{n}]^{d}$ and then extending it to the rest of $G_{n}$ by zero. Then, we define the modified Whitney extension,

[TABLE]

and the modified projection operator

[TABLE]

These are, respectively, bounded linear maps from $C(G_{n})$ to $C^{\beta}_{b}(\mathbb{R}^{d})$ and from $C^{0}_{b}(\mathbb{R}^{d})$ to $C^{\beta}_{b}(\mathbb{R}^{d})$ . Now we are ready to introduce the finite dimensional approximations to the operator $I$ , define

[TABLE]

That is, to compute $I_{n}(u,x)$ , we first compute the modified projection $\hat{\pi}_{n}^{\beta}u$ , and compute $I(\hat{\pi}_{n}^{\beta}u)$ , to which we later apply the modified projection $\hat{\pi}_{n}^{0}$ . In particular, $I_{n}$ only depends on the values of $u$ on $G_{n}\cap[-2^{n},2^{n}]^{d}$ . Associated to this, we introduce a map, $i_{n}$ , defined as follows

[TABLE]

From the definition of $I_{n}$ , we have $I_{n}=E_{n}^{\beta}\circ\textnormal{Pr}_{n}\circ T_{n}\circ I\circ E_{n}^{\beta}\circ\textnormal{Pr}_{n}\circ T_{n}$ , thus we see $I_{n}$ and $i_{n}$ are themselves related by

[TABLE]

The situation for both $I_{n}$ and $i_{n}$ is represented in the following two diagrams,

[TABLE]

Now, the space $C_{*}(G_{n})$ is finite dimensional (Remark 4.2), and the map $i_{n}:C_{*}(G_{n})\to C_{*}(G_{n})$ is Lipschitz continuous. Therefore, tools available for Lipschitz functions in the finite dimensional setting can be applied to $i_{n}$ and then related to $I_{n}$ via (5.3).

We recall the generalized derivative of $i_{n}$ in the sense of Clarke [18, Section 2.6].

Definition 5.1.

Let $V$ be a Banach space, and $T:V\to V$ a Lipschitz continuous function. We define the set of generalized derivatives of $T$ , by

[TABLE]

By Rademacher’s theorem, the set $\mathcal{D}T$ is not empty when $V$ is finite dimensional. Applying this to $i_{n}:C_{*}(G_{n})\to C_{*}(G_{n})$ , we have, first, that $\mathcal{D}i_{n}$ is non-empty, and secondly that $\mathcal{D}I_{n}$ is non-empty as well, this is proved in Lemma 5.3, where we describe the relationship between $\mathcal{D}i_{n}$ to $\mathcal{D}I_{n}$ . The following Lemma is the mean value theorem for nonsmooth Lipschitz functions between finite dimensional spaces (note the similarity with Theorem 2.5).

Lemma 5.2.

Assume that $I:C^{\beta}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is Lipschitz. For any $u,v\in C_{*}(G_{n})$ , there is a $L\in\mathcal{D}i_{n}$ such that

[TABLE]

Proof.

We refer the reader to [18, Proposition 2.6.5] for a proof of the lemma. ∎

The second lemma is basically the chain rule.

Lemma 5.3.

Assume that $I:C^{\beta}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is Lipschitz. The set $\mathcal{D}I_{n}$ is non-empty, and for any $L\in\mathcal{D}I_{n}$ there is a $\tilde{L}\in\mathcal{D}i_{n}$ such that

[TABLE]

conversely, any $L$ defined in this way for some $\tilde{L}\in\mathcal{D}i_{n}$ belongs to $\mathcal{D}I_{n}$ .

Proof.

Note that $I_{n}$ is differentiable at a point $u$ if and only if $i_{n}$ is differentiable at $\tilde{u}=T_{n}u$ , a fact which follows applying the chain rule to the identities (5.2) and (5.3). Furthermore, at such $u$ ’s we have

[TABLE]

If $u_{k}$ is a sequence along which $I_{n}$ is differentiable, and $L_{k}:=DI_{n}(u_{k})$ converges to some $L$ , then the sequence $\tilde{L}_{k}:=Di_{n}(\tilde{u}_{k})$ has a limit $\tilde{L}$ , and $L=E_{n}^{*}\circ\tilde{L}\circ T_{n}$ , taking the convex hull and by the linearity of $E_{n}^{*}$ and $T_{n}$ , the lemma follows. ∎

The following remark will not be of any relevance until the proof of Theorem 1.11 at the end of this section, but we include it here to illustrate how Lemmas 5.2 and 5.3 immediately yield a min-max formula for $I_{n}(u,x)$ (for $x\in G_{n}$ ).

Remark 5.4.

Fix $n$ and let $x\in G_{n}$ . Then for any $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ we have

[TABLE]

Indeed, according to Lemma 5.2 given $u$ and $v$ says there is some $\tilde{L}\in\mathcal{D}i_{n}$ such that

[TABLE]

In this case, we have $E_{n}^{0}(i_{n}(u))-E_{n}^{0}(i_{n}(v))=E_{n}^{0}(\tilde{L}(u-v))$ , and thus setting $L:=E_{n}^{0}\circ\tilde{L}\circ T_{n}\in\mathcal{D}I_{n}$ , we have

[TABLE]

and (5.4) immediately follows.

Next we make an elementary observation regarding the nature of the operators $L\in\mathcal{D}I_{n}$ . This observation is merely a consequence of the factorization of $I_{n}$ through the space $C(G_{n})$ .

Remark 5.5.

For each $L\in\mathcal{D}I_{n}$ there is a function $K=K_{L}$ , $K:G_{n}\times G_{n}\to\mathbb{R}$ such that

[TABLE]

Indeed, simply let us use the basis functions $\{e_{y}\}_{y\in G_{n}}\subset C(G_{n})$ given by

[TABLE]

Observe that for any $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ the function $T_{n}u$ has finite support, and in particular $T_{n}u=\sum_{y\in G_{n}}u(y)e_{y}$ as the sum on the right has at most a finite number of non-zero terms. Thanks to Lemma 5.3, there is some $\tilde{L}\in\mathcal{D}i_{n}$ such that $L=E^{0}_{n}\circ\tilde{L}\circ T_{n}$ and therefore,

[TABLE]

Then, defining $K_{L}(x,y)=(\tilde{L}e_{x+y})(x)$ for $x,y\in G_{n}$ the identity (5.5) follows.

For the rest of this section we analyze the operators $I_{n}$ and the sets $\mathcal{D}I_{n}$ and obtain in the limit a min-max formula for $I_{n}$ . We shall focus on operators satisfying Assumption 1.4. As we see below this property is inherited –to some extent– by the operators $I_{n}$ , and by any operator $L\in\mathcal{D}I_{n}$ , this fact is covered in the next two propositions. In the subsections that follow, we will use the spatial regularity afforded by Assumption 1.4 to show that the operators in the family $\mathcal{D}I_{n}$ have coefficients enjoying some regularity, which in the limit yields regular coefficients.

Proposition 5.6.

Let $I$ be Lipschitz and satisfy Assumption 1.4. Let $x_{1},x_{2}\in G_{n}$ and $h=x_{1}-x_{2}$ , and $r\geq 2^{4-n}$ . Then, for any $u,v\in C^{\beta}_{b}(\mathbb{R}^{d})$ we have

[TABLE]

where $\omega(\cdot)$ is the modulus of continuity and $C(\cdot)$ the function given by Assumption 1.4.

Proof.

Observe that

[TABLE]

and recall that Proposition 4.14 says that $\pi_{n}^{\beta}(\tau_{-h}u)=\tau_{-h}\pi_{n}^{\beta}(u)$ when $G_{n}+h=G_{n}$ .

Therefore, applying the bound in Assumption 1.4 with $\tfrac{3}{2}r$ ,

[TABLE]

Now, provided $r\geq 2^{4-n}$ , we have

[TABLE]

the proposition follows.

∎

Proposition 5.7.

Let $I$ be Lipschitz and satisfy Assumption 1.4. Given $L\in\mathcal{D}I_{n}$ , $x_{1},x_{2}\in G_{n}$ , $r\geq 2^{4-n}$ and $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ , we have the inequality

[TABLE]

Here, $h=x_{1}-x_{2}$ and $\omega(\cdot)$ and $C(\cdot)$ are given by Assumption 1.4.

Proof.

Consider any $v\in C^{\beta}_{b}(\mathbb{R}^{d})$ such that $I_{n}$ is differentiable at $v$ with derivative $L$ . Then,

[TABLE]

By Proposition 5.6, we have

[TABLE]

This proves the desired inequality for those $L\in\mathcal{D}I_{n}$ which happen to be the derivative of $I_{n}$ at a point of differentiability. This property is clearly preserved under limits and convex combinations, so it follows any $L\in\mathcal{D}I_{n}$ has the desired property. ∎

The following proposition is directly related to Proposition 4.20.

Proposition 5.8.

Assume that $I$ is Lipschitz and satisfies Assumption 1.1. For $f\in C^{\infty}_{c}(\mathbb{R}^{d})$ let $w(x)=f(x-x_{0})\eta_{0}(|x-x_{0}|^{\beta})$ with $\eta_{0}$ as in (4.5), then

[TABLE]

If instead we have $w(x)=f(x-x_{0})\eta_{0}(|x-x_{0}|^{\beta_{0}})$ with $f$ non-negative and some $\beta_{0}>\beta$ , then

[TABLE]

for some constant $\gamma=\gamma(\beta_{0},\beta)\in(0,1)$ .

Proof.

We apply Proposition 4.20, and we have with $\hat{R}_{\beta,n,w,x_{0}}$ from the same proposition, we have

[TABLE]

with equality holding for $x=x_{0}$ . It follows that $\pi_{n}^{\beta}u+\pi_{n}^{\beta}w$ is touched from above at $x_{0}$ by $\pi^{\beta}_{n}u+\hat{w}$ . Then, since $I(\cdot,x)$ has the GCP,

[TABLE]

This means that

[TABLE]

Since $\|\hat{w}\|_{C^{\beta}}=\|f\|_{L^{\infty}}\|\eta_{0}(|\cdot-x_{0}|^{\beta})+\hat{R}_{\beta,n,w,x_{0}}\|_{C^{\beta}}\leq C\|f\|_{L^{\infty}}$ the first inequality is proved. For the second inequality, we apply Remark 4.19 directly, and use that $I$ has the GCP to conclude that

[TABLE]

Then, using the Lipschitz property of $I$ we conclude that

[TABLE]

where we used that $|w(x)|\leq C\|f\|_{L^{\infty}}\min\{1,|x-x_{0}|^{\beta_{0}}\}$ and Remark 4.19 to obtain the last inequality.

∎

Proposition 5.9.

Let $I$ be Lipschitz and satisfy Assumption 1.3. Let $R\geq 1$ and $w\in C^{\beta}_{b}(\mathbb{R}^{d})$ with $w\equiv 0$ in $B_{3R}(x_{0})$ , then for any $x\in\cap B_{R}(x_{0})$ we have

[TABLE]

where $\rho$ is the rate coming from Assumption 1.3.

Proof.

If $w\equiv 0$ in $B_{3R}(x_{0})$ , then $\pi^{\beta}_{n}\equiv 0$ in $B_{2R}(x_{0})$ . In other words, $\pi_{n}^{\beta}u$ and $\pi_{n}^{\beta}u+\pi_{n}^{\beta}w$ are identically equal in $B_{2R}(x_{0})$ . Therefore, Assumption 1.3 says that

[TABLE]

By Proposition 4.20, $\|\pi_{n}^{\beta}w\|_{L^{\infty}(\mathbb{R}^{d})}\leq\|w\|_{L^{\infty}(\mathbb{R}^{d})}$ , the proposition is proved.

∎

5.2. Properties of $\mathcal{D}I_{n}$

For each $L\in\mathcal{D}I_{n}$ and $x\in G_{n}$ we define a Borel measure $\mu_{L}(x,dy)$ (which is possibly signed) as follows

[TABLE]

where $K_{L}(x,y)$ is as in Remark 5.5. From its definition, it is immediate that given $\phi\in C^{\beta}$ and $x\in G_{n}$ then

[TABLE]

Proposition 5.10.

Assume that $I$ is Lipschitz and satisfies Assumption 1.1. For each $L\in\mathcal{D}I_{n}$ and $x\in G_{n}$ , and $\eta_{0}(t)$ the function in (4.5),

[TABLE]

Proof.

Fix $x_{0}\in G_{n}$ . Let us assume first that $\beta\neq 1$ . Let $w(x)=f(x-x_{0})\eta_{0}(|x-x_{0}|^{\beta})$ , then

[TABLE]

Therefore it suffices to show there is a universal constant such that

[TABLE]

Let us prove this when $L$ arises as the derivative of $I_{n}$ at some $v\in C^{\beta}_{b}$ , namely, that

[TABLE]

In this case, we can apply Proposition 5.8 to the expression on the right and conclude that

[TABLE]

where we used that when $\beta\neq 1$ the function $\eta_{0}(|\cdot-x_{0}|^{\beta})$ belongs to $C^{\beta}_{b}(\mathbb{R}^{d})$ and the norm $\|\eta_{0}(|\cdot-x_{0}|^{\beta})\|_{C^{\beta}}$ is bounded in terms of $\beta,d,$ and the function $\eta_{0}$ . This the desired estimate for such $L$ . Since this property is clearly preserved under limits and convex combinations, it follows that the property holds for all elements of $\mathcal{D}I_{n}$ .

The case $\beta=1$ proceeds similarly, except one first fixes $\varepsilon\in(0,1)$ and considers the function $\eta_{0}(|x-x_{0}|^{\beta+\varepsilon})$ instead. After proceeding as in the previous case, we obtain the estimate

[TABLE]

for every $L\in\mathcal{D}I_{n}$ and $x_{0}\in G_{n}$ . The constant $C$ is independent of $\varepsilon\in(0,1)$ , since $\|\eta_{0}(|\cdot-x_{0}|^{\beta})\|_{C^{1}}$ is independent of $\varepsilon$ when $\varepsilon>0$ . Letting $\varepsilon\searrow 0$ for the integral on the left (and using the special form of $\mu_{L}(x_{0},dy)$ ) one obtains the estimate in the case $\beta=1$ .

∎

Proposition 5.11.

Assume that $I$ is Lipschitz and satisfies Assumption 1.1. Let $f\in C^{\infty}_{c}(\mathbb{R}^{d})$ be a non-negative function. There is a constant $C=C(I,d,\beta,\beta_{0})$ such that given $\beta_{0}>\beta$ then for each $L\in\mathcal{D}I_{n}$ and $x\in G_{n}$ ,

[TABLE]

As before, $\eta_{0}$ is the function in (4.5), and $\gamma=\gamma(\beta,\beta_{0})$ .

Proof.

As in the proof of the previous proposition, we note that if $x_{0}\in G_{n}$ , $w(x):=f(x-x_{0})\eta_{0}(|x-x_{0}|^{\beta_{0}})$ , and $L\in\mathcal{D}I_{n}$ , then

[TABLE]

As in the previous Proposition, it suffices to show that $L(w,x_{0})\geq-C\|f\|_{L^{\infty}}h_{n}^{\gamma}$ , and from $\mathcal{D}I_{n}$ ’s definition, it suffices to show this for those $L^{\prime}s$ in $\mathcal{D}I_{n}$ which are the derivative of $I_{n}$ at some $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ . In this case, given that $f\geq 0$ , we may apply the second part of Proposition 5.8 to obtain

[TABLE]

and the proposition is proved. ∎

Let us recall the function

[TABLE]

In this section we introduce a variation on this function. This modification takes into account the geometry of the grid $G_{n}$ as well as the regularity exponent $\beta$ , and will be used in a way analogous to the previous section.

[TABLE]

Associated with this, we introduce functions in $G_{n}$ taking (respectively) scalar, vector, and matrix values.

First, some notation. To functions $\eta,\phi\in\mathcal{S}$ we associate the following family of functions

[TABLE]

Then, for $L\in\mathcal{D}I_{n}$ and $\eta,\phi\in\mathcal{S}$ we define a symmetric matrix $A_{L,\eta}$ , a vector $B_{L,\phi}$ , and a scalar $C_{L}$ . These are functions in $G_{n}$ defined by the formulas,

[TABLE]

The functions $A_{L,\eta},B_{L,\phi},C_{L},$ and $\mu_{L}$ give us a representation for $L(u,x)$ for $x\in G_{n}$ .

Proposition 5.12.

Assume that $I$ is Lipschitz. Let $L\in\mathcal{D}I_{n}$ , then for $\beta\in[2,3)$ and $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ we may write it as

[TABLE]

For $\beta\in[1,2)$

[TABLE]

and for $\beta\in[0,1)$

[TABLE]

Proof.

We do the case $\beta\geq 2$ explicitly, as the others are identical. Let us compute $L(u,x)$ by adding and subtracting $L(P_{\phi,\eta,u,x}^{(n)},x)$ ,

[TABLE]

From Remark 5.5, (5.7), we have that

[TABLE]

As for the other term, we observe that

[TABLE]

Rewriting the terms on the right and gathering the terms, we conclude that

[TABLE]

The remaining cases of $\beta$ follow from the corresponding definition of $P^{(n)}_{\phi,\eta,u}$ in those cases.

∎

The next two propositions say that the terms appearing Proposition 5.12 satisfy a uniform continuity in $G_{n}$ . The first refers to the measure $\mu_{L}$ .

Proposition 5.13.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Let $L\in DI_{n}$ , $x_{1},x_{2}\in G_{n}$ , and $r\geq 2^{4-n}$ . There is a constant $C(r)$ such that for any $\zeta\in C_{c}(\mathbb{R}^{d})$ such that $\zeta\equiv 0$ in $B_{r}$ ,

[TABLE]

where $\omega$ is the modulus from Assumption 1.4. In particular,

[TABLE]

On the other hand, if $\zeta\in C^{0}(\mathbb{R}^{d})$ is such that $\zeta\equiv 0$ in $B_{3R}(0)$ for some $R>1$ , then for any $x_{0}\in G_{n}$ we have

[TABLE]

where $\rho(\cdot)$ is the function from Assumption 1.3.

Proof.

From the fact that $\tau_{-x_{1}}\zeta$ and $\tau_{-x_{2}}\zeta$ vanish in, respectively, $B_{r}(x_{1})$ and $B_{r}(x_{2})$ , we have

[TABLE]

Since $\zeta\equiv 0$ in $B_{r}$ , Proposition 5.7 says that, as long as $r\geq 2^{4-n}$

[TABLE]

This proves the first estimate, for the second one, fix $\zeta$ and $x_{0}\in G_{n}$ , and define $w(x)=\tau_{-x_{0}}\zeta$ , then

[TABLE]

Therefore, as before, it suffices for us to bound $L(w,x_{0})$ for every $L\in\mathcal{D}I_{n}$ , and from the definition of $\mathcal{D}I_{n}$ it suffices to prove the bound for those $L$ such that $L=DI_{n}(v)$ at some $v$ . In this case, Proposition 5.9 says that

[TABLE]

∎

The following notation will be useful in what follows,

[TABLE]

where $C(r)$ is as in Assumption 1.4 (see also Proposition 5.6).

Proposition 5.14.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Let $L\in\mathcal{D}I_{n}$ , $r\geq 2^{4-n}$ , and $x_{1},x_{2}\in G_{n}$ , then

[TABLE]

Proof.

Fix $x_{1},x_{2}\in G_{n}$ and let $h=x_{2}-x_{1}$ . Applying Proposition 5.7 to $x=x_{1}$ and $h$ , with the functions $1$ , $\phi_{i}$ , and $\eta_{ij}$ , we see that for $r\geq 2^{4-n}$

[TABLE]

These inequalities respectively amount to the stated estimate for $A_{L,\eta}$ , $B_{L,\phi}$ , and $C_{L}$ .

∎

5.3. Properties of $\mathcal{D}_{I}$

Now, we define the set $\mathcal{D}_{I}$ , which plays the role the Clarke differential played for $I_{n}$ (we recall that c.h. stands for “convex hull”).

[TABLE]

Remark 5.15.

We would like to note a point about notation and definitions, namely why above we have $\mathcal{D}_{I}$ with $I$ as a subscript. This is to avoid confusion (or perhaps, to promote it) by distinguishing it from the generalized derivative in the sense of Clarke from Definition 5.1. The objects are closely related, and in fact one would hope that $\mathcal{D}_{I}=\mathcal{D}I$ , but we are not concerned with whether this is actually the case as the above definition works for our purposes.

The following is an important Lemma that says –among other things– that $\mathcal{D}_{I}$ is non-empty.

Lemma 5.16.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Given a sequence $n_{k}\to\infty$ and operators $L_{n_{k}}$ with $L_{n_{k}}\in\mathcal{D}I_{n_{k}}$ for every $k$ , and $\phi,\eta\in\mathcal{S}$ we have the following

(1)

There is a subsequence $\bar{n}_{k}$ and functions $A(x),B(x),$ and $C(x)$ defined on $\mathbb{R}^{d}$ and taking values respectively in $\mathbb{S}(d)$ , $\mathbb{R}^{d}$ , and $\mathbb{R}$ , such that if $x\in G_{n}$ for some $n$ then we have the convergence

[TABLE] 2. (2)

There is a function $\mu(x)$ in $\mathbb{R}^{d}$ , taking values on the space of Lévy measures in $\mathbb{R}^{d}$ , such that for every $r>0$ , and every $x$ as before we have the convergence

[TABLE] 3. (3)

The functions $A,B,C,$ all have a modulus of continuity $C\omega(2(\cdot))$ , while for each $r>0$ we have the estimate,

[TABLE] 4. (4)

If we define $L$ by

[TABLE]

Then, $L\in\mathcal{D}_{I}$ . 5. (5)

Moreover, if $\beta<2$ , then we have $A(x)\equiv 0$ . Furthermore, if $\beta<1$ then $B(x)\equiv 0$ and $L$ takes the form

[TABLE]

Proof.

Let us fixe $\eta$ and $\phi$ . First of all, we invoke Proposition 5.12 to obtain the collection of $A_{L_{n_{k}},\eta}$ , $B_{L_{n_{k}},\phi}$ , $C_{L_{n_{k}}}$ , and $\mu_{L_{n_{k}}}$ . Furthermore, already as a result of Proposition 5.12, we have item (5) of the lemma.

Step 1. (Extension) We have a sequence of functions defined on varying, monotone increasing sets $G_{n}$ . One way to show they converge (along a subsequence) to a function in $\mathbb{R}^{d}$ is by extending them to all of $\mathbb{R}^{d}$ and check whether the resulting sequences are pre-compact.

With this idea in mind, for each $n\in\mathbb{N}$ we apply the Whitney extension to $A_{L_{n},\eta}$ , $B_{L_{n},\eta}$ , $C_{L_{n},\eta}$ ,

[TABLE]

We repeat the same for $\mu_{L_{n}}$ , resulting in a map $\hat{\mu}_{L_{n}}$ from $\mathbb{R}^{d}$ to the space of Lévy measures, given by the formula

[TABLE]

where $\{\phi_{k}\}_{k}$ is the partition of unity from Proposition 4.6. The functions $\hat{A}_{L_{n},\eta}$ , $\hat{B}_{L_{n},\phi}$ , and $\hat{C}_{L_{n}}(x)$ all have modulus of continuity $C\omega(2(\cdot))$ , thanks to Proposition 5.14 and the properties of the Whitney extension operator, see [54, Chapter VI, Theorem 3]. The same proof from reference [54] can be applied with minor modifications to show that for every $r>0$ we have

[TABLE]

Furthermore, for every $x$ , by Proposition 5.13,

[TABLE]

where $\rho(R)\to 0$ as $R\to\infty$ . This shows that for each $r>0$ , the functions $\{\hat{\mu}_{L_{n}}\mid_{\mathcal{C}B_{r}}\}_{n}$ are an equicontinuous family of functions taking values inside the space of measures $\nu$ which are supported in $\mathcal{C}B_{r}$ and such that $\nu(\mathcal{C}B_{R})\leq\rho(R)$ for all $R\geq r$ . This space, equipped with the total variation distance, is a compact metric space.

Step 2. (Cantor diagonalization) We now use a standard Cantor diagonalization argument to obtain locally uniform convergence along a subsequence. We construct a family nested sequences $\tilde{n}^{m}_{k}$ in the following recursive manner. First, $\tilde{n}^{1}_{k}$ is a subsequence of $n_{k}$ along which the functions converge uniformly in $B_{1}$ to functions $A^{1}(x),B^{1}(x)$ , and $C^{1}(x)$ ) defined in $B_{1}$ . Next, suppose that for $m\in\mathbb{N}$ we have build a nested family of sequences $\tilde{n}^{1}_{k},\ldots,\tilde{n}^{m}_{k}$ such that the functions $A_{L_{\tilde{n}^{m}_{k}},\eta},\ldots$ , etc converge uniformly in $B_{m}(0)$ to functions $A^{m}(x)\ldots$ , etc. In this case, we choose $\tilde{n}^{m+1}_{k}$ to be a subsequence of $\tilde{n}^{m}_{k}$ along which $A_{L_{\tilde{n}^{m+1}_{k}},\eta},\ldots$ converge uniformly in $B_{m+1}$ to functions $A^{m+1}(x)\ldots$ and so on.

Having constructed these $\tilde{n}^{m}_{k}$ , we define the sequence $\tilde{n}_{k}$ as $\tilde{n}_{k}:=n^{k}_{k}$ . The resulting sequences converge locally uniformly, respectively, to $A(x),B(x)$ , and $C(x)$ .

Step 3. (Cantor diagonalization continued)

As noted at the end of Step 1, for every $r>0$ , the sequence $\{\hat{\mu}_{L_{\tilde{n}_{k}}}\}_{k}$ is an equicontinuous family of functions taking values in a compact metric space. Therefore, we can apply the Arzela-Ascoli type theorem found in [24, p. 202] to obtain a subsequence $\bar{n}^{1}_{k}$ of $\tilde{n}_{k}$ and a measure $\mu^{1}$ such that

[TABLE]

Now, suppose we have repeated this $m$ times: we have $\bar{n}^{m}_{k}$ (a subsequence of $\bar{n}^{m-1}_{k}$ ), as well as a measure $\mu^{m}$ such that

[TABLE]

Then, using again the compactness theorem in [24, p. 202] we pick a subsequence $\bar{n}^{m+1}_{k}$ of $\bar{n}^{m}_{k}$ and a measure $\mu^{m+1}$ such that

[TABLE]

Observe that the measures $\{\mu^{m}\}$ are such that $\mu^{m+1}_{\mid\mathcal{C}B_{1/2^{m}}}(x)=\mu^{m}(x)$ for all $x\in B_{m}$ , which uniquely defines a direct limit measure $\mu(x)$ for each $x\in\mathbb{R}^{d}\setminus\{0\}$ . Letting $\bar{n}_{k}:=\bar{n}_{k}^{k}$ we see that for every $R>0$ and $r>0$ we have

[TABLE]

Since $\bar{n}_{k}$ is a subsequence of $\tilde{n}_{k}$ , we still have convergence of $A_{L_{\bar{n}_{k}},\eta},\ldots$ to $A(x),\ldots$ . Moreover, the continuity estimates in the previous step all pass to the limit to give respective estimates for $A(x),B(x),C(x),$ and $\mu(x)$ in the respective metrics.

Last but not least, we note that while $\{\mu_{L_{\bar{n}_{k}}}\}_{k}$ are a sequence of signed measures, their limit $\mu$ will be a measure, which follows at once from Proposition 5.11.

Step 4. (Convergence)

First, note that for fixed $u$ , we have that as $n\to\infty$ ,

[TABLE]

which in particular guarantees that, for every fixed $r>0$ ,

[TABLE]

Then, by the bound in Proposition 5.10, we conclude that

[TABLE]

Therefore, and taking into account the convergence of $\hat{A}_{L_{\tilde{n}_{k}},\eta},\hat{B}_{L_{\tilde{n}_{k}},\phi},$ and $\hat{C}_{L_{\tilde{n}_{k}}}$ , and with $L(u,x)$ defined as in the statement of the Lemma, $x\in G_{n}$ , and $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ , we have

[TABLE]

and we conclude that $L\in\mathcal{D}_{I}$ .

∎

It is to be expected that every $L\in\mathcal{D}_{I}$ satisfies the GCP, and thus, it has to be an operator of Lévy type. This is proved in the lemma below, and further, we show that the coefficients in the operator inherit a modulus of continuity from Assumption 1.4.

Lemma 5.17.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Given $L\in\mathcal{D}_{I}$ , and any $\phi,\eta\in\mathcal{S}$ , the operator $L$ can be represented as

[TABLE]

Here, $\mu_{L}(x,dy)$ is a Lévy measure satisfying the continuity estimate (5.12), and

[TABLE]

all have modulus of continuity $C\omega(2(\cdot))$ .

Proof.

Fix $\phi,\eta\in\mathcal{S}$ . Assume first that $L$ is the limit of a sequence $L_{n_{k}}$ with $L_{n_{k}}\in\mathcal{D}I_{n_{k}}$ . Then, by Lemma 5.16 there is a subsequence $\tilde{n}_{k}$ as well as (matrix, vector, scalar, measure)-valued functions $A,B,C$ , and $\mu$ , all such that

[TABLE]

and, as a result, we have

[TABLE]

The estimate in Proposition 5.10 in the limit as $n\to\infty$ implies that

[TABLE]

for some constant $C$ independent of $x$ and $L$ . Meanwhile, also the $n\to\infty$ limit of the estimate in Proposition 5.11 implies that $\mu(x,dy)$ is a non-negative measure in $\mathbb{R}^{d}\setminus\{0\}$ . The positivity of $\mu$ means that the previous estimate is equivalent to

[TABLE]

Since $L_{\tilde{n}_{k}}(u,x)\to L(u,x)$ , for every $u$ , we have in particular, for $x\in\bigcup G_{k}$

[TABLE]

From where it follows that $(A_{L,\eta})_{ij}(x)=L(\tau_{-x}\eta_{ij},x)$ (and thus for all $x$ , by continuity), the exact same argument yields that $(B_{L,\phi})_{i}(x)=L(\tau_{-x}\phi_{i},x)$ , and $C_{L}(x)=L(1,x)$ , and the lemma is proved.

∎

Let us now simplify things by doing away with the auxiliary functions $\phi$ and $\eta$ . To accomplish this, we shall make use of the auxiliary functions from Section 3.

[TABLE]

where we recall the two-parameter of functions $\psi_{r,R}(x)$ was defined in (3.2). An important property of these one-parameter families is the bound

[TABLE]

Corollary 5.18.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Then, any $L\in\mathcal{D}_{I}$ has the form,

[TABLE]

Moreover, $A,B,$ and $C$ each have modulus of continuity $C\omega(2(\cdot))$ , and for every $r>0$ and any $x_{1},x_{2}\in\mathbb{R}^{d}$ we have

[TABLE]

If $\beta<2$ , then $A\equiv 0$ , while if $\beta<1$ then $B\equiv 0$ and the integrand with respect to $\mu(x,dy)$ in the formula above is replaced with $u(x+y)-u(x)$ .

Proof.

Take a decreasing sequence $\delta_{k}$ such that $\delta_{k}\to 0$ , and let us take the functions $\phi_{\delta_{k}}$ and $\eta_{\delta_{k}}$ , as defined in (5.13). Then for each $k$ , $L$ has the representation

[TABLE]

where $A_{L,\eta_{\delta_{k}}}$ , $B_{L,\phi_{\delta_{k}}},$ and $C_{L}$ are as in Lemma 5.17. Now, $L$ satisfies the estimate

[TABLE]

Thanks to (5.14), it follows that $\alpha(1,\eta_{\delta_{k}})\leq C$ for all $k$ . It follows that $\{A_{L,\eta_{\delta_{k}}}\}_{k}$ has a uniform modulus of continuity. The same argument yields a modulus of continuity for $\{B_{L,\phi_{\delta_{k}}}\}_{k}$ and for the function $C(x)$ , all given by $C\omega(2|x_{1}-x_{2}|)$ , with $C$ independent of $k$ and $\omega$ being the modulus from Assumption 1.4. This equicontinuity means these sequences of functions are pre-compact at least when restricted to any compact subset of $\mathbb{R}^{d}$ , by the Arzela-Ascoli theorem. Therefore, after a Cantor diagonalization argument we see that along some subsequence $m_{k}\to\infty$ these functions converge locally uniformly in $\mathbb{R}^{d}$ to functions $A(x)$ , $B(x)$ , respectively. Of course, the functions $A,B,$ and $C$ all inherit the modulus of continuity $C\omega(2(\cdot))$ . The respective TV-norm continuity estimate for $\mu_{L}$ follows by applying Proposition 5.13 and passing to the limit (always recalling that, $\mathcal{D}_{I}$ is the convex hull of such limit points).

With the convergence established, we have

[TABLE]

and so, for every $u$ we have the formula

[TABLE]

It remains to compute the limit of the integral, observe that

[TABLE]

which means that

[TABLE]

Therefore,

[TABLE]

On the other hand, for every $y$ we have

[TABLE]

and the limit is monotone. Therefore, by monotone convergence we conclude that

[TABLE]

and with this the Corollary is proved. ∎

5.4. Limits of $I_{n}$

Lemma 5.19.

Assume that $I:C^{\beta}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ is Lipschitz. Let $K>0$ and $0<\beta<\beta_{0}<3$ . If $u\in C^{\beta_{0}}_{b}(\mathbb{R}^{d})$ is supported in $B_{K}$ , and $2^{n-2}\geq K$ , then

[TABLE]

for a universal constant $C$ and $\gamma=\gamma(\beta_{0},\beta)\in(0,1)$ . Furthermore, we have

[TABLE]

Proof.

Let $u$ be compactly supported in $B_{K}$ , and be such that $\|u\|_{C^{\beta_{0}}}\leq M$ . First, note that since $2^{n-2}\geq K$ , then we have

[TABLE]

thus, $I_{n}(u)=\hat{\pi}_{n}^{0}\circ I\circ\pi_{n}^{\beta}(u)$ . Keeping this in mind, using the Lipschitz property of $I$ , we have

[TABLE]

Since $2^{n-2}\geq K$ we have that $I(\hat{\pi}_{n}^{\beta}u)=\hat{\pi}_{n}^{0}I(\hat{\pi}_{n}^{\beta}u)=I_{n}(u)$ when restricted to $B_{K}\cap G_{n}$ , which thanks to Lemma 4.21 implies the first estimate. Next, Theorem 4.13 guarantees that

[TABLE]

Thus,

[TABLE]

Applying Lemma 4.21 to the first term and Remark 4.22 to the second, we conclude that

[TABLE]

∎

Corollary 5.20.

Assume $I$ satisfies Assumptions 1.1, 1.3, and 1.4, as stated for $C^{\beta}_{b}(\mathbb{R}^{d})$ . Then for every $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ and every $R>0$ ,

[TABLE]

Proof.

Fix $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ and $R,\varepsilon>0$ . For $K>0$ (to be determined later), we may decompose $u$ as $u=u_{0}+u_{1}$ , where $u_{0}$ is compactly supported in $B_{2K+1}$ and $u_{1}\equiv 0$ in $B_{2K}$ , all such that

[TABLE]

The constant $C>1$ being independent of $K$ . Now, by Assumption 1.3 and since $u\equiv u_{0}$ in $B_{2K}$ , we have

[TABLE]

Choose $K$ large enough so that $K\geq 2R$ and $2C\rho(R)\|u\|_{C^{\beta}(\mathbb{R}^{d})}\leq\varepsilon/2$ . Then, with this $K$ , we apply Lemma 5.19 two times, and conclude that there is some $n_{0}>0$ such that

[TABLE]

On the other hand, in all $\mathbb{R}^{d}$ we have the pointwise inequality,

[TABLE]

and it follows that, for $x\in B_{R}$ and $n\geq n_{0}$ , that

[TABLE]

and the corollary is proved.

∎

5.5. Proofs of Theorems 1.11 and 1.14

We conclude this section with the proofs of the remaining theorems.

Proof of Theorem 1.11.

Consider the set $\mathcal{D}_{I}$ . The proof will boil down to showing that for any $u,v\in C^{\beta_{0}}_{c}(\mathbb{R}^{d})$ and any $x\in\mathbb{R}^{d}$ there is some $L\in\mathcal{D}_{I}$ such that

[TABLE]

Fix $u,v$ and $x$ . Then, by Remark 5.4, for every $n$ we have

[TABLE]

In particular, for every $n$ , there is some $L_{n}\in\mathcal{D}I_{n}$ such that (with this same $u,v$ and $x$ )

[TABLE]

Let us obtain an inequality as we let $n\to\infty$ along some subsequence. Thanks to Corollary 5.20, for every $x\in\mathbb{R}^{d}$ we have

[TABLE]

On the other hand, Lemma 5.16 says there is a subsequence $n_{k}$ and an operator $L$ such that $L_{n_{k}}(u-v,x)$ converges to $L(u-v,x)$ , and moreover $L\in\mathcal{D}_{I}$ , by the definition of $\mathcal{D}_{I}$ . Then, we conclude that

[TABLE]

The above holds for any pair of functions $u$ and $v$ and any point $x\in\mathbb{R}^{d}$ . Taking the minimum over all $v$ , we obtain for any $u$ and $x$ ,

[TABLE]

Using $v\in C^{\beta}_{b}(\mathbb{R}^{d})$ and $L\in\mathcal{D}_{I}$ as the set of labels, which we rename $ab$ , and letting $f_{ab}(x)$ correspond to the functions $I(v,x)-L(v,x)$ , we obtain the desired min-max representation.

The $L^{\infty}$ bounds for the coefficients follow from the construction of $A_{\eta_{k}}$ , etc… in (5.8), (5.9), (5.10). The continuity of the coefficients and the Lévy measures follows from Lemma 5.16.

∎

Proof of Theorem 1.14.

For the versions of Theorems 1.9 and 1.10 with $\beta<2$ we apply the last part of Lemma 3.9 to conclude the functionals (or translation invariant operators) appearing in the min-max all have the corresponding simpler form. As for Theorem 1.11, we use instead the last part of Corollary 5.18 to obtain the simpler expresion for the Lévy operators in the cases where $\beta<2$ .

∎

6. Some Examples

In this section we list some examples to which our results apply, yet the integro-differential structure given in either (1.2) or (1.3) is not readily apparent from the definition of the operator itself. We emphasize that most cases of the linear examples that we list were already contained in the classic work of Courrège [19], but we include them here for the sake of illustration. In all of these examples, the operators satisfy the GCP and the other technical requirements to apply the results presented above. We do not intend to give all details, but rather just make a list, with some appropriate references. At the end of the section, we list how these examples relate to Assumptions 1.1–1.4.

6.1. The statement of the examples.

Example 6.1.

The generator of a Markov process. Assume that $X_{t}$ is a Markov process taking values in $\mathbb{R}^{d}$ , and that $\mathbb{E}_{x}$ is the expectation of the process, having started from $x$ at $t=0$ . The generator is defined as the operator

[TABLE]

over all $u$ for which the limit exists. (See Liggett [42, Chapter 3].)

Thanks to the fact that $\mathbb{E}$ preserves ordering, one can immediately see that $L$ enjoys the GCP. When $X_{t}$ is such that $L:C^{2}_{b}\to C^{2}_{b}$ , this example is covered by Courrège [19]; but if $X_{t}$ is such that $L:C^{\beta}_{b}\to C^{0}_{b}$ (in a Lipschitz fashion) for some $0<\beta<2$ , then by Theorem 1.14, there are fewer terms (see the list just above Theorem 1.14 for our use of the notation $C^{\beta}_{b}(\mathbb{R}^{d})$ ). In this context, the result of Courrège can be seen as a version of the Lévy-Khintchine formula for a process whose increments need not be stationary.

Example 6.2.

The Dirichlet-to-Neumann map for linear, elliptic operators on half-space. Assume that $L$ is an operator that admits unique bounded solutions on $\mathbb{R}^{d+1}_{+}$ and that has a comparison principle. What we mean by this is the following: we can take $u\in C^{1,\alpha}_{b}(\mathbb{R}^{d})$ and associate to it the unique bounded solution, $U_{u}$ of

[TABLE]

A couple of reasonable examples would be

[TABLE]

where $A$ is uniformly elliptic and Hölder continuous. The Dirichlet-to-Neumann map is then defined as

[TABLE]

First of all, the assumptions on $A$ are such that for some $\alpha^{\prime}$ , $U_{u}\in C^{1,\alpha^{\prime}}_{b}\left(\overline{\mathbb{R}^{d+1}_{+}}\right)$ and hence the normal derivative is well defined (see, e.g. [23, Chapters 8, 9]). It is not hard to check that this operator satisfies the GCP, and this fact comes entirely from the property that the solution operator, by the assumed comparison principle, preserves ordering of solutions whenever the boundary data are ordered (it has nothing to do with linearity of the solution operator). This is, again, within the context of Courrège’s result, but we can invoke Theorem 1.14 to remove extra terms of order higher than $1$ . Ellipticity and scaling show that this is always an operator of order $1$ (and will map $C^{1,\alpha}\to C^{\alpha^{\prime}}$ ). We note that in this example, via linear equations with nice coefficients, one can derive lots of information about the operator $\partial_{n}U_{u}$ by directly using the Poisson kernel that represents the solution $U_{u}$ .

In the context of periodic equations, one can use the results in Sections 4 and 5 to show that the coefficients in the resulting Lévy operators will share the same periodicity. In fact, this is very straightforward if $I$ is linear. If instead one looks at almost periodic coefficients, it seems reasonable to hope that the coefficients will also be almost periodic, but we have not checked this claim. If it is the case, there could be an application to some boundary homogenization problems with irrationally oriented half-spaces inside a periodic medium, related to [31]. Operators related to the Dirichlet-to-Neumann mapping of this example are also of interest in conformal geometry, see Chang-Gonzalez [13]. It is also possible to consider an elliptic equation with weights in order to obtain some operators of order different than 1, e.g. Caffarelli-Silvestre [8].

Example 6.3.

The boundary process of a reflected diffusion. (See Hsu [32], or [33, Chp. IV, Sec. 7] and/or [45, Sec. 8].)

In this context, one starts with a diffusion in $\mathbb{R}^{d+1}_{+}$ , say $X_{t}$ , so that $X_{t}$ reflects off of the bottom boundary whenever it reaches it. Under a time rescaling of $X_{t}$ (because it spends zero time on the boundary), the resulting process can be viewed at times only when it hits $\mathbb{R}^{d}\times\{0\}$ , and induces a pure jump process on $\mathbb{R}^{d}\times\{0\}$ . This process is generated by an operator of the form (1.2) with $A\equiv 0$ . It turns out that this generator for the boundary process is exactly the Dirichlet-to-Neumann mapping from the previous example. This process was studied in a smooth domains for Brownian motion by Hsu [32].

Example 6.4.

Subordinated diffusions and Bernstein functions. (See Schilling-Song-Vondraček [46].)

The time-rescaling of the reflected diffusion in the previous example is just one choice of a rescaling, and in general one can time-rescale diffusions on $\mathbb{R}^{d}$ (so no boundary space here) in a myriad of fashions to create new stochastic processes from one reference Brownian motion. This is a process known as subordination, and it can be used to create operators with generators in the class (1.2), starting with one that may simply only contain the second order term. The generator for the subordinated process will enjoy the GCP because the generator of the original diffusion also enjoys the GCP. This technique has played a large and fundamental role in the study of Lévy processes, and one can see it in use in e.g., the book of Schilling-Song-Vondraček [46], especially [46, Chapter 13]. The subordination formula is closely related to an extension into plus one space variables, and this extension was used to create operators of fractional order that enjoy the GCP in the work of Stinga-Torrea [55] and also provide other properties of the fractional operators.

Example 6.5.

The Monge-Ampère operator, $\textnormal{MA}(u,x)=\textnormal{det}(D^{2}u)$ .

When one restricts this operator to the subset of $C^{2}$ of convex functions, then MA is in fact (degenerate) elliptic and locally Lipschitz. Specifically for each $\delta>0$ , MA is uniformly elliptic (depending upon $\delta$ ), Lipschitz, and translation invariant as a mapping,

[TABLE]

Thus, MA, must enjoy a min-max structure. Experts have known and utilized this min-max propert of MA in the study of fully nonlinear elliptic equations for a long time, and one can show that

[TABLE]

In fact, this formula is intimately connected with various investigations into nonlocal operators that should be an analog of MA in the fractional setting (as of yet, there is not one that is considered better than others). Some works that address nonlocal analogs of MA are: [7], [11], and [28].

Example 6.6.

General nonlocal operators as treated in Caffarelli-Silvestre [9] [10]. These are simply operators that are assumed to satisfy the GCP, are defined for all functions in $C^{1,1}(\mathbb{R}^{d})$ , map $C^{2}_{b}(\mathbb{R}^{d})\to C^{0}_{b}(\mathbb{R}^{d})$ , and satisfy a form of uniform ellipticity that is given by the existence of concave respectively convex operators, $\mathcal{M}^{-}_{\mathcal{L}}$ and $\mathcal{M}^{+}_{\mathcal{L}}$ so that

[TABLE]

Here, $\mathcal{L}$ is a class of linear operators that is usually a particular subset of those that satisfy the Lévy type condition (1.2).

This context for nonlocal operators was given in [9, Definition 3.1], and it played an important role in many of the results– especially when $\mathcal{L}$ is chosen to contain certain classes of operators. These operators, in cases in which they are Lipschitz fall into the scope of our results, and furthermore, the role of the extremal operators gives extra information about the min-max formula. In particular, as shown in [29, Section 4.6], when ellipticity occurs with respect to $\mathcal{M}^{\pm}_{\mathcal{L}}$ , then the min-max may be restricted to only utilize linear functionals (or linear operators) that also satisfy the extremal inequality in (6.1). This also appeared in a homogenization result by one of the authors in which they were unable to show that the limit operator had an explicit integro-differential formula, but rather was only integro-differential and uniformly elliptic in the sense of [9, Definition 3.1] ( see the homogenization in [47]).

Example 6.7.

The Dirichlet to Neumann map for fully nonlinear elliptic equations. In Example 6.2, the linearity of $L$ is not necessary, and the function $U_{u}$ can also be taken to solve a fully nonlinear, uniformly elliptic equation in $\mathbb{R}^{d+1}_{+}$ . These equations always possess a comparison principle (by definition), and under most reasonable assumptions, the solution $U_{u}$ will be globally $C^{1,\alpha^{\prime}}$ , allowing for the normal derivative to be defined classically (see [52] for this regularity).

This was a main topic in the recent paper by the authors and Kitagawa [26]. It turns out that the extremal operators (as in Example 6.6) for the nonlinear D-to-N not only play a crucial role in investigating the Lévy measures in the min-max, but they also take a refreshingly simple form. The extremal operators in this case, $\mathcal{M}^{\pm}_{\mathcal{L}}$ of Example 6.6, are simply the Dirichlet-to-Neumann operators for the solutions of the corresponding extremal operators for the elliptic second order equation in $\mathbb{R}^{d+1}_{+}$ . These are usually called the Pucci extremal operators (see [12]), and solutions to their equations are generally very well behaved. In [26], the properties of the Lévy measures in the min-max are linked to the harmonic measures for linear equations with bounded measurable coefficients (e.g. [39]), but there is still more to learn about them before they can be connected with existing integro-differential theory.

Example 6.8.

An operator that drives surface evolution in one and two phase free boundary problems related to a type of Hele-Shaw flow. Given $f\in C^{1,\alpha}(\mathbb{R}^{d})$ , such that $0<\inf f\leq\sup f<\infty$ , we can define the unique solution, $U_{f}$ , of the elliptic equation,

[TABLE]

This allows to define a (fully nonlinear) operator on $f$ as

[TABLE]

that is, the normal derivative of the solution on the upper boundary given by the graph of $f$ .

For Hele-Shaw flow in the simplified setting that the free boundary is parametrized by the graph of $f(\cdot,t)$ , it can be shown that the free boundary evolves by a normal velocity that at each time is given by $I(f,x)$ . The interpretation here is that fluid flows into the domain under a pressure at the bottom boundary, $x_{d+1}=0$ , and the top edge of the fluid exists at $x_{d+1}=f(x)$ , with $U_{f}$ representing the pressure of the fluid. This pressure induces a force on the fluid, which is given by $\partial_{n}U_{f}(x,f(x))$ at the top boundary. This operator, and its implications for rewriting a class of free boundary problems that are similar to Hele-Shaw was studied by the authors and Chang Lara in [16]. In particular, the min-max formula makes it straightforward to convert the free boundary flow into a nonlocal parabolic equation for $f$ , and this parabolic equation is very similar to ones that have already been studied in the nonlocal literature (e.g. [51]). When $U_{f}$ is defined to be harmonic in the domain determined by $f$ , standard regularity theory immediately gives estimates that show there is some $\alpha^{\prime}$ so that the mapping from $f$ to $I(f)$ is Lipschitz from $C^{1,\alpha}(\mathbb{R}^{d})$ to $C^{\alpha^{\prime}}(\mathbb{R}^{d})$ . In [16] it was also shown that the same Lipschitz property can be obtained when $U_{f}$ is defined as the solution of a nonlinear uniformly elliptic second order equation instead of just the Laplacian. This operator gives a good example of what can be said in the translation invariant case of the min-max, and its properties are studied initially in [16]. Even in the simplest case of defining $U_{f}$ to be harmonic, the resulting operator $I$ will always be inherently nonlinear and nonlocal.

6.2. Relationship to Assumptions 1.1–1.4

Here we list how each of the above examples fits within the context of Assumptions 1.1–1.4.

(Example 6.1). By construction, this $L$ is always linear. Thus, Assumption 1.1 follows from simply saying that $L$ is a bounded operator on $C^{\beta}$ , which of course requires assumptions on the process, $X_{t}$ , or more specifically the transition probability measure for $X_{t}$ . Again, via linearity, Assumption 1.2 follows whenever the process, $X_{t}$ , has stationary and independent increments. Assumptions 1.3 and 1.4 will be an extra requirement on the transition probability measure for $X_{t}$ . In particular (although a bit circular), Assumption 1.4, in view of linearity, is equivalent to the martingale problem for $X_{t}$ having a solution and the generator having uniformly continuous coefficients.

(Example 6.2). (The interested reader can see [26] for more details.) Assumption 1.1 holds for $C^{1,\alpha}\to C^{\alpha^{\prime}}$ when $A$ is $\alpha$ -Hölder continuous. Assumption 1.2 holds if $A$ is a constant. Assumption 1.3 holds in both of the above settings, by using a barrier argument (which is easier implemented for the non-divergence equation). Since $I$ is linear, Assumption 1.4 holds when $A$ is Hölder continuous. Indeed, by linearity, checking Assumption 1.4 is equivalent to estimating

[TABLE]

In the case of divergence equations, one can write down the equations satisfied for $V=\tau_{-z}U_{u}$ , and then also the equation satisfied by $W:=U_{\tau_{-z}u}-V$ . The desired estimate is then equivalent to estimating $\left|\partial_{n}W(x+z)\right|$ , i.e. a global Lipschitz estimate for $W$ . Since $W$ satisfies

[TABLE]

we see that by global Lipschitz estimates,

[TABLE]

by the original assumption that $A$ is Hölder continuous. (Note, the Lipschitz estimates here are a standard modification to, e.g. [25, Lemma 3.2] to allow for a right hand side of the form $\textnormal{div}(f)$ with $f\in L^{\infty}$ .)

(Example 6.3). In most reasonable situations in which the diffusion has regular coefficients, this is contained in the previous example.

(Example 6.4). This, of course, depends heavily on the original Markov process and the choice of subordinator. However, one of the most classical situations starts with a Brownian motion and then uses a Lévy stable subordinator. In this case, the resulting operator is translation invariant, and Assumptions 1.1 and 1.2 follow more or less by construction.

(Example 6.5). This is a translation invariant operator, and as mentioned already satisfies the Lipschitz property on the specified convex subsets of $C^{2}$ . So, Assumptions 1.1 and 1.2 hold.

(Example 6.6). As this is a general example, the operators only satisfy the given assumptions when explicitly required to do so. However, the interesting part of this example arises from the fact that the knowledge of the extremal inequalities in (6.1) in fact gives more detailed information about the linear operators that will appear in the min-max of Theorems 1.9–1.14. This is discussed in [29, Section 4.6].

(Example 6.7). This operator satisfies Assumption 1.1 as a mapping of $C^{1,\alpha}\to C^{\alpha^{\prime}}$ (for some $0<\alpha^{\prime}<\alpha$ ) under standard assumptions about $F$ . The relevant regularity theory comes from Silvestre-Sirakov [52]. It can also be checked by using the same type of barrier argument that works for Example 6.2 will show Assumption 1.3 is also satisfied. Due to the nonlinear nature of the D-to-N in this setting, it is not obvious how to show that Assumption 1.4 is satisfied– we do not know if it satisfied or not. Thus, the best one can say about this operator when it is not translation invariant is the outcome of Theorem 1.9. We simply note to the interested reader that because of the lack of exact cancelation from the fact that the mapping is not linear, one probably needs more detailed information about $F$ . Indeed, using the extremal operators would not help because it would produce

[TABLE]

Here we use $M^{\pm}$ as the extremal operators for $I$ , and also that these are translation invariant. This estimate completely neglects the influence of the shift, $\tau_{z}$ , and so it would not be useful (furthermore, one expects that $M^{+}(u,x)>M^{-}(u,x)$ ).

(Example 6.8). As it is stated above, this operator, $I$ , is actually translation invariant, and so it is straightforward to check that Assumptions 1.1 and 1.2 hold. In the case that the equation for $U$ (i.e. $\Delta U=0$ ) is replaced by either a fully nonlinear operator and/or and operator that is not translation invariant, it is harder to check all of the applicable assumptions. Again, for fully nonlinear equations that define $U$ , in [16] $I$ was checked to be Lipschitz as a map of $C^{1,\alpha}\to C^{\alpha^{\prime}}$ (which took a reasonably non-trivial amount of work).

Appendix A Additional proofs and computations

Proof of Proposition 4.10.

Fix $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ , and let $x\in G_{n}$ , then by the regularity of $u$ ,

[TABLE]

Therefore,

[TABLE]

For the second estimate, we shall make use of

[TABLE]

Therefore,

[TABLE]

It follows that

[TABLE]

and the proposition is proved. ∎

Proof of Proposition 4.11.

Fix $u\in C^{\beta}_{b}(\mathbb{R}^{d})$ .

Step 1. Let $x\in G_{n}$ , then

[TABLE]

Proof of Step 1. By the regularity of $u$ ,

[TABLE]

Therefore,

[TABLE]

Step 2. Given $x\in G_{n}$ , we have

[TABLE]

Step 3.

[TABLE]

∎

Computation for Lemma 4.17.

[TABLE]

If $|x-x_{0}|^{\beta_{0}}\leq h_{n}$ , then

[TABLE]

This expression is zero except when $|x-x_{0}|\leq h_{n}^{1/\beta_{0}}$ , so

[TABLE]

Furthermore, for $x,x^{\prime}$ such that $|x-x_{0}|^{\beta_{0}}\leq h_{n}$ , we have

[TABLE]

In conclusion,

[TABLE]

∎

Bibliography55

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Luis Alvarez, Frédéric Guichard, Pierre-Louis Lions, and Jean-Michel Morel. Axioms and fundamental equations of image processing. Arch. Rational Mech. Anal. , 123(3):199–257, 1993.
2[2] G. Barles, E. Chasseigne, and C. Imbert. On the Dirichlet problem for second-order elliptic integro-differential equations. Indiana Univ. Math. J. , 57(1):213–246, 2008.
3[3] G. Barles, E. Chasseigne, and C. Imbert. Hölder continuity of solutions of second-order elliptic integro-differential equations. J. Eur. Math. Soc. , 13(1):1–26, 2011.
4[4] Guy Barles, Emmanuel Chasseigne, Adina Ciomaga, and Cyril Imbert. Lipschitz regularity of solutions for mixed integro-differential equations. J. Differential Equations , 252(11):6012–6060, 2012.
5[5] Guy Barles and Cyril Imbert. Second-order elliptic integro-differential equations: viscosity solutions’ theory revisited. Ann. Inst. H. Poincaré Anal. Non Linéaire , 25(3):567–585, 2008.
6[6] Guy Barles and Panagiotis E. Souganidis. A new approach to front propagation problems: theory and applications. Arch. Rational Mech. Anal. , 141(3):237–296, 1998.
7[7] Luis Caffarelli and Fernando Charro. On a fractional Monge-Ampère operator. Ann. PDE , 1(1):Art. 4, 47, 2015.
8[8] Luis Caffarelli and Luis Silvestre. An extension problem related to the fractional Laplacian. Comm. Partial Differential Equations , 32(7-9):1245–1260, 2007.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Min-max formulas for nonlocal elliptic operators on Euclidean Space

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem 1**.**

Theorem 2**.**

Theorem 3**.**

1.1. Assumptions and main results

Assumption 1.1**.**

Assumption 1.2**.**

Assumption 1.3**.**

Assumption 1.4**.**

Remark 1.5**.**

Remark 1.6**.**

Remark 1.7**.**

Definition 1.8**.**

Theorem 1.9**.**

Theorem 1.10**.**

Theorem 1.11**.**

Definition 1.12**.**

Assumption 1.13**.**

Theorem 1.14**.**

Remark 1.15**.**

1.2. Notation

1.3. Background

1.4. Another description of operators satisfying the GCP

Definition 1.16**.**

2. Real valued Lipschitz functions on Banach Spaces

Definition 2.1**.**

Proposition 2.2**.**

Proof.

Definition 2.3**.**

Proposition 2.4**.**

Proof.

Theorem 2.5** (Lebourg’s Theorem).**

Proof.

Theorem 2.6**.**

Proof.

3. Functionals with the GCP, revisited

Definition 3.1**.**

Proposition 3.2**.**

Remark 3.3**.**

Proof.

Lemma 3.4**.**

Proof.

Definition 3.5**.**

Lemma 3.6**.**

Remark 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Lemma 3.9**.**

Proof.

Corollary 3.10**.**

3.1. Proofs of Theorems 1.9 and 1.10

Proof of Theorem 1.10.

Proof of Theorem 1.9.

Remark 3.11**.**

4. Finite Dimensional Approximations to Cbβ(Rd)C^{\beta}_{b}(\mathbb{R}^{d})Cbβ​(Rd)

4.1. Graph approximations

Definition 4.1**.**

Remark 4.2**.**

4.2. Cube decomposition and partition of unity

Lemma 4.3**.**

Proof.

Remark 4.4**.**

Remark 4.5**.**

Proposition 4.6**.**

Proof.

4.3. Discrete derivatives

Definition 4.7**.**

Definition 4.8**.**

Remark 4.9**.**

Theorem 1.

Theorem 2.

Theorem 3.

Assumption 1.1.

Assumption 1.2.

Assumption 1.3.

Assumption 1.4.

Remark 1.5.

Remark 1.6.

Remark 1.7.

Definition 1.8.

Theorem 1.9.

Theorem 1.10.

Theorem 1.11.

Definition 1.12.

Assumption 1.13.

Theorem 1.14.

Remark 1.15.

Definition 1.16.

Definition 2.1.

Proposition 2.2.

Definition 2.3.

Proposition 2.4.

Theorem 2.5 (Lebourg’s Theorem).

Theorem 2.6.

Definition 3.1.

Proposition 3.2.

Remark 3.3.

Lemma 3.4.

Definition 3.5.

Lemma 3.6.

Remark 3.7.

Lemma 3.8.

Lemma 3.9.

Corollary 3.10.

Remark 3.11.

4. Finite Dimensional Approximations to $C^{\beta}_{b}(\mathbb{R}^{d})$

Definition 4.1.

Remark 4.2.

Lemma 4.3.

Remark 4.4.

Remark 4.5.

Proposition 4.6.

Definition 4.7.

Definition 4.8.

Remark 4.9.

Proposition 4.10.

Proposition 4.11.

Definition 4.12.

Theorem 4.13.

Proposition 4.14.

Remark 4.15.

Proposition 4.16.

Lemma 4.17.

Remark 4.18.

Remark 4.19.

Proposition 4.20.

Lemma 4.21.

Remark 4.22.

5. Analysis of $I(u,x)$ via the finite dimensional approximations

5.1. The operators $I_{n}$ and their min-max representation

Definition 5.1.

Lemma 5.2.

Lemma 5.3.

Remark 5.4.

Remark 5.5.

Proposition 5.6.

Proposition 5.7.

Proposition 5.8.

Proposition 5.9.

5.2. Properties of $\mathcal{D}I_{n}$

Proposition 5.10.

Proposition 5.11.

Proposition 5.12.

Proposition 5.13.

Proposition 5.14.

5.3. Properties of $\mathcal{D}_{I}$

Remark 5.15.

Lemma 5.16.

Lemma 5.17.