Double phase image restoration

Petteri Harjulehto; Peter H\"ast\"o

arXiv:1906.09837·math.AP·May 27, 2021

Double phase image restoration

Petteri Harjulehto, Peter H\"ast\"o

PDF

TL;DR

This paper investigates the use of double phase functionals for image restoration, focusing on mathematical properties and convergence of energy minimizers in the context of bounded variation functions.

Contribution

It introduces a novel analysis of double phase energy minimizers for BV functions and establishes their connection via $Gamma$-convergence and relaxation techniques.

Findings

01

Double phase energy minimizers are characterized for BV functions.

02

The energy can be obtained through $Gamma$-convergence of regularized functionals.

03

A capped fractional maximal function is used as a key analytical tool.

Abstract

In this paper we explore the potential of the double phase functional in an image processing context. To this end, we study minimizers of the double phase energy for functions with bounded variation and show that this energy can be obtained by $Γ$ -convergence or relaxation of regularized functionals. A central tool is a capped fractional maximal function of the derivative of $B V$ functions.

Equations99

u in f \int_{Ω} ∣\nabla u ∣ + ∣ u - f ∣^{2} d x,

u in f \int_{Ω} ∣\nabla u ∣ + ∣ u - f ∣^{2} d x,

\int_{Ω} ∣\nabla u ∣^{p} + a (x) ∣\nabla u ∣^{q} d x .

\int_{Ω} ∣\nabla u ∣^{p} + a (x) ∣\nabla u ∣^{q} d x .

∣ D u ∣ (Ω) + \int_{Ω} (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x

∣ D u ∣ (Ω) + \int_{Ω} (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x

\int_{Ω} ∣\nabla u ∣^{1 + ε} + (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x and \int_{Ω} ∣\nabla u ∣ + (ε + a (x)^{2}) ∣\nabla u ∣^{2} + ∣ u - f ∣^{2} d x .

\int_{Ω} ∣\nabla u ∣^{1 + ε} + (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x and \int_{Ω} ∣\nabla u ∣ + (ε + a (x)^{2}) ∣\nabla u ∣^{2} + ∣ u - f ∣^{2} d x .

\|u\|_{L^{p}_{a}(\Omega)}:=\|a\,u\|_{L^{p}(\Omega)}=\Big{(}\int_{\Omega}(a(x)|u|)^{p}\,dx\Big{)}^{\frac{1}{p}}.

\|u\|_{L^{p}_{a}(\Omega)}:=\|a\,u\|_{L^{p}(\Omega)}=\Big{(}\int_{\Omega}(a(x)|u|)^{p}\,dx\Big{)}^{\frac{1}{p}}.

|\mu|(A)=\sup\Big{\{}\sum_{i\in\mathbb{N}}|\mu(A_{i})|\,\Big{|}\,\bigcup_{i\in\mathbb{N}}A_{i}=A,\ A_{i}\text{ disjoint and measurable}\Big{\}}.

|\mu|(A)=\sup\Big{\{}\sum_{i\in\mathbb{N}}|\mu(A_{i})|\,\Big{|}\,\bigcup_{i\in\mathbb{N}}A_{i}=A,\ A_{i}\text{ disjoint and measurable}\Big{\}}.

|Du|(\Omega):=\sup\bigg{\{}\int_{\Omega}u\mathop{div}\nolimits\varphi\,dx\,\Big{|}\,\varphi\in C^{1}_{0}(\Omega;{\mathbb{R}^{n}}),|\varphi|\leqslant 1\bigg{\}}<\infty.

|Du|(\Omega):=\sup\bigg{\{}\int_{\Omega}u\mathop{div}\nolimits\varphi\,dx\,\Big{|}\,\varphi\in C^{1}_{0}(\Omega;{\mathbb{R}^{n}}),|\varphi|\leqslant 1\bigg{\}}<\infty.

D u = \nabla u H^{n} + (u_{+} - u_{-}) ν_{u} H^{n - 1} ∣_{J_{u}} + C_{u},

D u = \nabla u H^{n} + (u_{+} - u_{-}) ν_{u} H^{n - 1} ∣_{J_{u}} + C_{u},

u_{i} \to u in L^{1} (Ω) and ∣ D u ∣ (Ω) ⩽ lim inf ∣ D u_{i} ∣ (Ω) .

u_{i} \to u in L^{1} (Ω) and ∣ D u ∣ (Ω) ⩽ lim inf ∣ D u_{i} ∣ (Ω) .

\nabla (u * η_{δ}) (x) = \int_{R^{n}} η_{δ} (x - y) d D u (y) = \int_{R^{n}} u (y) \nabla η_{δ} (x - y) d y .

\nabla (u * η_{δ}) (x) = \int_{R^{n}} η_{δ} (x - y) d D u (y) = \int_{R^{n}} u (y) \nabla η_{δ} (x - y) d y .

I (u, A) := ∣ D u ∣ (A) + \int_{A} (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x

I (u, A) := ∣ D u ∣ (A) + \int_{A} (a (x) ∣\nabla u ∣)^{2} + ∣ u - f ∣^{2} d x

I (u, Ω) = v \in B V_{a}^{1, 2} (Ω) in f I (v, Ω) .

I (u, Ω) = v \in B V_{a}^{1, 2} (Ω) in f I (v, Ω) .

i \to \infty lim I (u_{i}, Ω) = v \in B V_{a}^{1, 2} (Ω) in f I (v, Ω) .

i \to \infty lim I (u_{i}, Ω) = v \in B V_{a}^{1, 2} (Ω) in f I (v, Ω) .

\int_{Ω} (a (x) ∣\nabla u ∣)^{2} d x ⩽ lim inf \int_{Ω} (a (x) ∣\nabla u_{i} ∣)^{2} d x .

\int_{Ω} (a (x) ∣\nabla u ∣)^{2} d x ⩽ lim inf \int_{Ω} (a (x) ∣\nabla u_{i} ∣)^{2} d x .

I_{ε} (u, A) := \int_{A} ∣\nabla u ∣^{1 + ε} + (ε + a (x)^{2}) ∣\nabla u ∣^{2} + ∣ u - f ∣^{2} d x .

I_{ε} (u, A) := \int_{A} ∣\nabla u ∣^{1 + ε} + (ε + a (x)^{2}) ∣\nabla u ∣^{2} + ∣ u - f ∣^{2} d x .

i \to \infty lim sup I_{ε_{i}} (u_{i}, F) ⩽ I (u, F) .

i \to \infty lim sup I_{ε_{i}} (u_{i}, F) ⩽ I (u, F) .

\limsup_{\delta\to 0}\Big{[}|Du_{\delta}|(F)+\int_{F}|u_{\delta}-f|^{2}\,dx\Big{]}\leqslant|Du|(F)+\int_{F}|u-f|^{2}\,dx.

\limsup_{\delta\to 0}\Big{[}|Du_{\delta}|(F)+\int_{F}|u_{\delta}-f|^{2}\,dx\Big{]}\leqslant|Du|(F)+\int_{F}|u-f|^{2}\,dx.

a (x) ∣\nabla u_{δ} ∣ ⩽ 2 \int_{R^{n}} a (y) ∣\nabla u (y) ∣ η_{δ} (x - y) d y ≲ M (a ∣\nabla u ∣) (x);

a (x) ∣\nabla u_{δ} ∣ ⩽ 2 \int_{R^{n}} a (y) ∣\nabla u (y) ∣ η_{δ} (x - y) d y ≲ M (a ∣\nabla u ∣) (x);

a (x) - c ∣ x - y ∣ ⩽ a (y) ⩽ \frac{1}{2} a (x),

a (x) - c ∣ x - y ∣ ⩽ a (y) ⩽ \frac{1}{2} a (x),

a (x) ∣\nabla u_{δ} ∣ ≲ \int_{R^{n}} δ ∣ u (y) \nabla η_{δ} (x - y) ∣ d y ≲ \frac{1}{∣ B ( x , δ ) ∣} \int_{B (x, δ)} ∣ u (y) ∣ d y ⩽ M u (x),

a (x) ∣\nabla u_{δ} ∣ ≲ \int_{R^{n}} δ ∣ u (y) \nabla η_{δ} (x - y) ∣ d y ≲ \frac{1}{∣ B ( x , δ ) ∣} \int_{B (x, δ)} ∣ u (y) ∣ d y ⩽ M u (x),

δ \to 0 lim \int_{F} (a (x) ∣\nabla u_{δ} ∣)^{2} d x = \int_{F} (a (x) ∣\nabla u ∣)^{2} d x .

δ \to 0 lim \int_{F} (a (x) ∣\nabla u_{δ} ∣)^{2} d x = \int_{F} (a (x) ∣\nabla u ∣)^{2} d x .

δ \to 0 lim sup I (u_{δ}, F) ⩽ I (u, F) .

δ \to 0 lim sup I (u_{δ}, F) ⩽ I (u, F) .

\int_{F} ∣\nabla u_{δ} ∣^{1 + ε} + (ε + a (x)^{2}) ∣\nabla u_{δ} ∣^{2} d x ⩽ (\frac{c}{δ ^{n}})^{ε} ∣ D u_{δ} ∣ (F) + \int_{F} (a (x) ∣\nabla u_{δ} ∣)^{2} d x + ε (\frac{c}{δ ^{n}})^{2} ∣ F ∣.

\int_{F} ∣\nabla u_{δ} ∣^{1 + ε} + (ε + a (x)^{2}) ∣\nabla u_{δ} ∣^{2} d x ⩽ (\frac{c}{δ ^{n}})^{ε} ∣ D u_{δ} ∣ (F) + \int_{F} (a (x) ∣\nabla u_{δ} ∣)^{2} d x + ε (\frac{c}{δ ^{n}})^{2} ∣ F ∣.

i \to \infty lim sup I_{ε_{i}} (u_{i}, F)

i \to \infty lim sup I_{ε_{i}} (u_{i}, F)

\displaystyle\qquad\leqslant\limsup_{i\to\infty}\Big{[}(\tfrac{c}{\delta_{i}^{n}})^{\varepsilon_{i}}|Du_{i}|(F)+\int_{F}(a(x)|\nabla u_{i}|)^{2}+|u_{i}-f|^{2}\,dx+\varepsilon_{i}(\tfrac{c}{\delta_{i}^{n}})^{2}|F|\Big{]}

\displaystyle\qquad=\limsup_{i\to\infty}\Big{[}|Du_{i}|(F)+\int_{F}(a(x)|\nabla u_{i}|)^{2}+|u_{i}-f|^{2}\,dx\Big{]}\leqslant\operatorname{\mathcal{I}}(u,F).\qed

a (x) ≲ max {∣ x - y ∣, a (y)} for all x, y \in Ω.

a (x) ≲ max {∣ x - y ∣, a (y)} for all x, y \in Ω.

M_{α}^{σ} μ (x) := r ⩽ d iam Ω sup \frac{min { ∣ μ ∣ ( B ( x , r )) , r ^{σ} }}{∣ B ( x , r ) ∣ ^{1 - \frac{α}{n}}}

M_{α}^{σ} μ (x) := r ⩽ d iam Ω sup \frac{min { ∣ μ ∣ ( B ( x , r )) , r ^{σ} }}{∣ B ( x , r ) ∣ ^{1 - \frac{α}{n}}}

M_{α}^{σ} μ (x) ≲ k \in K_{0} sup \frac{μ _{k} ( 3 D _{k}^{x} )}{2 ^{(n - α) k}},

M_{α}^{σ} μ (x) ≲ k \in K_{0} sup \frac{μ _{k} ( 3 D _{k}^{x} )}{2 ^{(n - α) k}},

M_{\alpha}^{\sigma}\mu(x)^{p}\lesssim\sup_{k\in K_{0}}\Big{(}\frac{\mu_{k}(3D^{x}_{k})}{2^{(n-\alpha)k}}\Big{)}^{p}\leqslant\sum_{k\in K_{0}}\Big{(}\frac{\mu_{k}(3D^{x}_{k})}{2^{(n-\alpha)k}}\Big{)}^{p}.

M_{\alpha}^{\sigma}\mu(x)^{p}\lesssim\sup_{k\in K_{0}}\Big{(}\frac{\mu_{k}(3D^{x}_{k})}{2^{(n-\alpha)k}}\Big{)}^{p}\leqslant\sum_{k\in K_{0}}\Big{(}\frac{\mu_{k}(3D^{x}_{k})}{2^{(n-\alpha)k}}\Big{)}^{p}.

\int_{Ω} M_{α}^{σ} μ (x)^{p} d x

\int_{Ω} M_{α}^{σ} μ (x)^{p} d x

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Double phase image restoration

Petteri Harjulehto

Petteri Harjulehto, Department of Mathematics and Statistics, FI-20014 University of Turku, Finland

[email protected]

and

Peter Hästö

Peter Hästö, Department of Mathematics and Statistics, FI-20014 University of Turku, Finland, and Department of Mathematics, FI-90014 University of Oulu, Finland

[email protected]

Abstract.

In this paper we explore the potential of the double phase functional in an image processing context. To this end, we study minimizers of the double phase energy for functions with bounded variation and show that this energy can be obtained by $\Gamma$ -convergence or relaxation of regularized functionals. A central tool is a capped fractional maximal function of the derivative of $BV$ functions.

Key words and phrases:

Image restoration, double phase, bounded variation, Gamma-convergence, relaxation, fractional maximal function

2010 Mathematics Subject Classification:

49J45; 49N45, 94A08

1. Introduction

The double phase functional was introduced in the 1980s by Zhikov [46], but has only recently become the focus of intense research, starting in 2015 with Baroni, Colombo and Mingione [5, 6, 15, 17]. Subsequently, many other researchers studied double phase problems as well, see, e.g., [9, 18, 20, 22, 39, 40] for regularity theory, [10, 21, 43] for Calderón–Zygmund estimates and [27, 36, 37] for some other topics. Generalizations of the double phase functional have been studies, e.g. in [7, 25, 26, 28, 33, 34, 38, 45].

Zhikov’s original motivation for his functionals with non-standard growth was modelling physical phenomena. Another of his models, the variable exponent functional, was later applied also to the context of image processing, see [1, 14, 31, 35]. In this article, we demonstrate the potential also of the double phase functional in the image processing domain. This is to the best of our knowledge the first paper to consider the double phase functional in the space $BV$ of functions of bounded variation.

In mathematical image processing, we interpret a function $u:\Omega\to\mathbb{R}$ as the gray-scale intensity at each location. If the function is discretized, we obtain an array of pixels common in computer implementations. Typically, $\Omega$ is a rectangle and the image contains different objects whose edges correspond to discontinuities of $u$ . The presence of discontinuities makes this field challenging to approach with tools of analysis, but the $BV$ space has proven useful. We refer to the book [4] by Aubert and Kornprobst for an overview of PDE-based image processing.

The classical ROF-model [41] for image restoration calls for minimizing the energy

[TABLE]

where $f$ is the given, corrupted input image that is to be restored. Here $|u-f|^{2}$ is a fidelity term which forces $u$ to be close to $f$ on average, whereas the regularizing term $|\nabla u|$ limits the variation of $u$ . This model is known to be prone to a stair-casing or banding effect whereby piecewise constant minimizers are often produced [13]. On the other hand, replacing $|\nabla u|$ by $|\nabla u|^{2}$ leads to a heat-equation type problem, and solutions which are $C^{\infty}$ . This is not usually desirable in the image processing context, as edges become blurred.

The energy of the double phase functional combines growth with two different powers. It is given by the expression

[TABLE]

Here $a\geqslant 0$ is a bounded function and $p<q$ . All the previously mentioned double-phase references concern super-linear growth (usually $p>1$ , but see also [22]). However, for image processing, the case $p=1$ and $q=2$ is especially interesting (see above and the discussion in [14]). Then the first term corresponds to the ROF-model, whereas the second term introduces a smoothing effect when $a>0$ . The parameter $a$ is chosen such that $a=0$ at the edges in the image and $a>0$ elsewhere. Usually, the location of the edges is not known, so in applications $a$ is estimated from the initial data $f$ . Then this adaptive model can avoid the stair-casing effect of the ROF-model.

In the case $p=1$ , the double phase energy must naturally be studied in a space of $BV$ -type. It is not difficult to prove existence of the minimizer even in this case (cf. Proposition 2.4). However, the $BV$ -space is quite ill-behaved, so it is useful for practical implementations to approximate the energy by more regular functionals (see, e.g., [44, Section 6] in the image processing context). The notion of $\Gamma$ -convergence is often employed in this situation [8, 19], and this article is no exception: our main result (Theorems 4.1 and 4.2) shows that the $BV$ double phase functional (with fidelity term)

[TABLE]

can be approximated in the sense of $\Gamma$ -convergence by both

[TABLE]

Finally, in Corollary 4.3, we show that the $BV$ double phase functional can be understood as the relaxation of the $W^{1,1}$ double phase functional.

Note that we use $a$ inside the power-function, $(a(x)t)^{2}$ . This is of course equivalent to having another function outside, but it turns out that the condition on $a$ can be more conveniently expressed with this formulation (see Remark 3.3).

2. Notation and existence of minimizers of bounded variation

We consider subsets of the Euclidean space ${\mathbb{R}^{n}}$ , $n\geqslant 2$ . The most interesting case for image processing is $n=2$ , but we can include higher dimensions without extra complication. By $\Omega\subset{\mathbb{R}^{n}}$ we denote a bounded domain, i.e. an open and connected set. The notation $f\lesssim g$ means that there exists a constant $C>0$ such that $f\leqslant Cg$ . By $c$ we denote a generic constant whose value may change between appearances. Let $a\in L^{\infty}(\Omega)$ be non-negative. By $L^{p}_{a}(\Omega)$ we denote the weighted Lebesgue space with weight $a$ , given by the norm

[TABLE]

$W^{1,p}_{a}(\Omega)$ is the corresponding Sobolev space. Note that we use the “weight as multiplier” formulation, so the corresponding weighted measure is $d\mu=a^{p}\,dx$ , not $d\mu=a\,dx$ . By ${\mathcal{H}}^{k}$ we denote the $k$ -dimensional Hausdorff measure. By $|\mu|$ we denote the total variation measure of a vector measure $\mu$ , defined as

[TABLE]

By $Mu$ we denote the Hardy–Littlewood maximal function of $u$ .

A function $u\in L^{1}(\Omega)$ has bounded variation, denoted $u\in BV(\Omega)$ , if

[TABLE]

Note that this quantity is sometimes denoted by $\|Du\|(\Omega)$ . We follow the notation of [3], which is convenient since it turns out that $|Du|$ is the total variation of a vector measure $Du$ . Furthermore, $Du$ can be decomposed as

[TABLE]

where $\nabla u$ is the absolutely continuous part of the derivative, $u_{+}-u_{-}$ is the essential point-wise jump of the function, $\nu_{u}$ is the normal of the level-set, $J_{u}$ is a set of Hausdorff dimension at most $n-1$ [2, Theorem 2.3] and the Cantor part $C_{u}$ has the property that $C_{u}(A)=0$ if ${\mathcal{H}}^{n-1}(A)<\infty$ [3, Proposition 3.92]. The space $BV$ has the following precompactness property [3, Proposition 3.13]: if $\sup_{i}\big{(}|Du_{i}|(\Omega)+\|u_{i}\|_{L^{1}(\Omega)}\big{)}<\infty$ , then there exists a subsequence, denoted again by $(u_{i})$ , and $u\in BV(\Omega)$ such that

[TABLE]

The derivative of the convolution of a $BV$ -function can be calculated as expected using either the derivative-measure or the function [3, Proposition 3.2 and equation (2.2)]:

[TABLE]

We refer to [2, 3, 8] for more information about $BV$ spaces.

We abbreviate $BV^{1,2}_{a}(\Omega):=BV(\Omega)\cap W^{1,2}_{a}(\Omega)\cap L^{2}(\Omega)$ and define for $u\in BV^{1,2}_{a}(\Omega)$ and initial data $f\in L^{2}(\Omega)$ the $BV$ double phase functional

[TABLE]

for measurable $A\subset\Omega$ . We can easily show the existence of a minimizer for this functional using the direct method of calculus of variations:

Proposition 2.4.

There exists a unique minimizer $u\in BV^{1,2}_{a}(\Omega)$ , i.e.

[TABLE]

Proof.

Let $u_{i}$ be a minimizing sequence, that is $u_{i}\in BV^{1,2}_{a}(\Omega)$ with

[TABLE]

By $BV$ -precompactness (2.2) there exists a subsequence, denoted again by $(u_{i})$ , such that $u_{i}\to u$ in $L^{1}(\Omega)$ and $|Du|(\Omega)\leqslant\liminf|Du_{i}|(\Omega)$ . The space $W^{1,2}_{a}(\Omega)$ is reflexive [29, Theorem 3.6.8], so we can find a weakly convergent subsequence $(u_{i})$ . By [24, Theorem 2.2.8], the modular in $W^{1,2}_{a}(\Omega)$ is weakly lower semicontinuous, so that

[TABLE]

The inequality for the term $|u-f|^{2}$ follows analogously. Hence $u$ is a minimizer.

Finally, we note that the $BV$ and $W^{1,2}_{a}$ parts are convex and the $|u-f|^{2}$ part is strictly convex, so the usual argument yields uniqueness, namely, if $u$ and $v$ are distinct minimizers, then we obtain a contradiction from $\operatorname{\mathcal{I}}(\frac{u+v}{2},\Omega)<\frac{1}{2}(\operatorname{\mathcal{I}}(u,\Omega)+\operatorname{\mathcal{I}}(v,\Omega))$ . ∎

3. Lower estimates for the $BV$ double phase functional

To be able to construct the minimizers of $\operatorname{\mathcal{I}}$ with some numerical scheme, we must show that the $BV$ double phase functional can be approximated by some more regular variants. We regularize the functional by adding $\varepsilon$ either to the exponent of the first term (so that the problem is in $W^{1,1+\varepsilon}(\Omega)$ ) or to the weight $a$ (in which case the problem is in $W^{1,2}(\Omega)$ ). For brevity, we present the proof only for one case which includes both these regularizations:

[TABLE]

We start with a lower bound for $\operatorname{\mathcal{I}}$ , which is the more difficult part.

Lemma 3.1.

Let $F\subset\Omega$ be closed and $a\in C^{0,1}(\Omega)$ . For $\varepsilon_{i}\to 0^{+}$ and $u\in BV^{1,2}_{a}(\Omega)$ , there exist $u_{i}\in W^{1,2}(U)$ in a neighborhood $U$ of $F$ such that

[TABLE]

Proof.

Let $u_{\delta}:=u*\eta_{\delta}$ be the convolution with the standard mollifier and assume that $\delta<\operatorname{dist}(F,\partial\Omega)$ . By [30, Lemma 4.5] and classical $L^{2}$ -results

[TABLE]

For the term with the weight $a$ , we consider two cases and use the different expressions from (2.3). If $0<a(x)\leqslant 2a(y)$ for all $y\in B(x,\delta)$ , then

[TABLE]

note that the condition $0<a(y)$ with $u\in W^{1,2}_{a}(\Omega)$ ensures that $Du=\nabla u$ is absolutely continuous in $B(x,\delta)$ and note also that the last inequality follows from elementary estimates (e.g. [24, Lemma 4.6.3]). Furthermore, since $a|\nabla u|\in L^{2}(\Omega)$ and the maximal operator is bounded on $L^{2}(\Omega)$ , we see that the function on the right-hand side is in $L^{2}(\Omega)$ , as well. If $a(x)=0$ , then the estimate trivially holds. Suppose then that $a(x)>2a(y)$ for some $y\in B(x,\delta)$ . Since $a\in C^{0,1}(\Omega)$ , we obtain the inequality

[TABLE]

so that $a(x)\lesssim|x-y|\leqslant\delta$ . Therefore

[TABLE]

where we used that $|\delta\nabla\eta_{\delta}|\lesssim\delta^{-n}\chi_{B(x,\delta)}$ for the middle step. Again, since $u\in L^{2}(\Omega)$ , we obtain an upper bound independent of $\delta$ in the space $L^{2}(\Omega)$ . In the set $\{a>0\}$ we have $\nabla u_{\delta}\to\nabla u$ almost everywhere. Thus it follows by dominated convergence in $L^{2}(\Omega)$ that

[TABLE]

We have so far shown that

[TABLE]

It remains to change the first functional from $\operatorname{\mathcal{I}}$ to $\operatorname{\mathcal{I}}_{\varepsilon_{i}}$ . Equation (2.3) implies $|\nabla u_{\delta}|\leqslant\frac{c}{\delta^{n}}$ , where $c$ depends on $|Du|(\Omega)$ . Therefore

[TABLE]

We choose $\delta_{i}:=\varepsilon_{i}^{1/(3n)}$ so that $(\tfrac{c}{\delta_{i}^{n}})^{\varepsilon_{i}}\to 1$ and $\varepsilon_{i}(\tfrac{c}{\delta_{i}^{n}})^{2}\to 0$ and set $u_{i}:=u_{\delta_{i}}$ . Then

[TABLE]

*Remark 3.3**.*

From the previous proof we can see that the exact condition used for $a$ is not $C^{0,1}(\Omega)$ , but rather the inequality

[TABLE]

This means that we could replace $a(x)^{2}$ in the double phase functional with $a(x)^{q}$ for $a\in C^{0,\alpha}(\Omega)$ as long as $q\alpha\geqslant 2$ . This kind of condition was first identified for the double phase functional in [29, Section 7.2].

With the method of the previous proof, one can obtain from (3.2) that $a(x)|\nabla u_{\delta}|$ is bounded by $M_{\alpha}(Du)$ when $a\in C^{0,\alpha}(\Omega)$ and $M_{\alpha}$ denotes the fractional maximal operator (cf. Lemma 3.5). This will allow us to prove the result for bounded functions $u$ with a larger class of weights $a$ . A number of recent studies, e.g. [11, 12], deal with the question of the Sobolev regularity of the maximal function $M_{\alpha}u$ of a Sobolev or $BV$ function $u$ . However, we have not found any results on the maximal function of the derivative of a $BV$ function. Therefore, the following result may be of independent interest.

Proposition 3.4.

*Let $\mu$ be a vector Borel measure in $\Omega$ with finite total variation $|\mu|(\Omega)<\infty$ , $\sigma\in(0,n)$ and $\alpha\in(0,n-\sigma)$ . Then the capped fractional maximal function *

[TABLE]

belongs to $L^{p}(\Omega)$ if $p<1+\frac{\alpha}{n-\sigma-\alpha}$ .

Furthermore, the bound is sharp since the claim does not hold for $p\geqslant 1+\frac{\alpha}{n-\sigma-\alpha}$ .

Proof.

We consider dyadic cubes intersecting $\Omega$ with side-length at most $\mathop{diam}\nolimits\Omega$ . Specifically, we assume that the cubes are of the form $[a_{1},b_{1})\times\cdots\times[a_{n},b_{n})$ and denote by $\mathcal{D}_{k}$ the set of such cubes with side-length $2^{k}$ . Let $D^{x}_{k}\in{\mathcal{D}}_{k}$ be the cube which contains $x$ and $3D^{x}_{k}$ be its threefold dilate. We define $\mu_{k}(A):=\min\{|\mu|(A\cap\Omega),2^{\sigma k}\}$ . If $2^{k-1}\leqslant r<2^{k}$ , then $B(x,r)\subset 3D^{x}_{k}$ . Thus

[TABLE]

where $K_{0}:=\{-\infty,\ldots,k_{0}\}$ and $k_{0}$ is the smallest integer with $2^{k_{0}}>\mathop{diam}\nolimits\Omega$ . We raise this to the power $p$ and estimate the supremum by a sum:

[TABLE]

Next we integrate over $\Omega$ and use that $\mu_{k}(3D^{x}_{k})$ can be estimated by the sum of $3^{n}$ terms of the form $\mu_{k}(D_{k})$ with $D_{k}\in{\mathcal{D}}_{k}$ . Thus we obtain that

[TABLE]

Let us maximize the sum $\sum_{D\in{\mathcal{D}}_{k}}\mu_{k}(D)^{p}$ separately for each $k$ . Since ${\mathcal{D}}_{k}\cap\Omega$ is a partition of $\Omega$ , we can write this optimization problem as

[TABLE]

where $a_{i}=\mu_{k}(D_{i})$ for $D_{i}\in{\mathcal{D}}_{k}$ ; the last restriction holds since $\mu_{k}(\Omega)\leqslant 2^{\sigma k}$ by the definition of $\mu_{k}$ . We consider what values of the $a_{i}$ ’s leads to a maximally large sum. If $0<a_{i}<a_{j}<2^{\sigma k}$ , then

[TABLE]

for $0<t<\min\{a_{i},2^{\sigma k}-a_{j}\}$ . Therefore the sum is maximized subject to the constraints when $a_{i}=2^{\sigma k}$ for as many indices as possible and zero for the rest. There are no more than $\lceil 2^{-\sigma k}|\mu|(\Omega)\rceil$ such maximal indices. Thus

[TABLE]

We use this estimate in our previous inequality, and conclude that

[TABLE]

The last sum is finite if $-(n-\alpha)p+(p-1)\sigma+n>0$ , which is equivalent to the condition in the proposition.

It remains to prove sharpness. For simplicity we consider only the case when $\sigma$ is an integer. We let $E$ be a $\sigma$ -dimensional plane and define $\mu(A):={\mathcal{H}}^{\sigma}(E\cap A)$ . Denote $d(x):=\mathop{dist}\nolimits(x,E)$ . Then

[TABLE]

We raise this to the power $p$ and integrate over $x$ :

[TABLE]

This integral diverges if $(\sigma-n+\alpha)p+n-\sigma\leqslant 0$ , which gives the claimed bound for $p$ . In the case of non-integer $\sigma$ , we instead choose our set as the Cartesian product of a plane and a Cantor set, and estimate as before. ∎

With the fractional maximal operator we can extend Lemma 3.1 in the case of bounded functions. Bounded functions are very natural in the context of image processing, since the grey-scale values are usually taken in some compact interval such as $[0,255]$ or $[0,1]$ . Note that to use the previous proposition, we cannot directly move to the total variation measure $|Du|$ , since this is not in general going to satisfy the appropriate decay $r^{n-1}$ when $u$ is bounded. Rather, we have to first estimate the absolute value of the measure of a ball, $|Du(B(x,r))|$ , and only afterward move to $|Du|$ . In the next result we therefore work with the vector measure $Du$ rather than its total variation, which makes the estimates slightly more difficult.

Lemma 3.5.

Let $F\subset\Omega$ be closed and $a\in C^{0,\alpha}(\Omega)$ for some $\alpha>\frac{1}{2}$ . For $\varepsilon_{i}\to 0^{+}$ and $u\in BV^{1,2}_{a}(\Omega)\cap L^{\infty}(\Omega)$ , there exist $u_{i}\in W^{1,2}(U)\cap L^{\infty}(\Omega)$ in a neighborhood $U$ of $F$ such that

[TABLE]

Proof.

The proof is identical to that of Lemma 3.1, except for the estimate of $a(x)|\nabla u_{\delta}|$ in the second case, $a(x)<\frac{1}{2}a(y)$ . Let us show that we can use Proposition 3.4 to handle this case. By the construction of the measure $Du$ ,

[TABLE]

for all $\varphi\in C^{1}_{0}(B(x,r);{\mathbb{R}^{n}})$ , cf. [3, Proposition 3.6]. We choose $\varphi(y)=b\xi(|x-y|)$ where $b\in B(0,1)$ and $\xi\in C^{1}([0,\infty))$ with $\xi|_{[0,r-\varepsilon-\varepsilon^{2}]}=1$ , $\xi|_{[r-\varepsilon^{2},\infty)}=0$ and $|\xi^{\prime}|\leqslant\frac{2}{\varepsilon}$ . Then $|\mathop{div}\nolimits\varphi|\leqslant\frac{2}{\varepsilon}\chi_{B(x,r-\varepsilon^{2})\setminus B(x,r-\varepsilon-\varepsilon^{2})}$ and so

[TABLE]

since $u$ is bounded. It follows by monotone convergence as $\varepsilon\to 0^{+}$ that

[TABLE]

Therefore, $|Du(B(x,r))|\lesssim\min\{|Du|(B(x,r)),r^{n-1}\}$ and so

[TABLE]

On the other hand, we can estimate for the derivative of the convolution using (2.3), the distribution function of $Du$ [42, Theorem 8.16] and the estimate $|\frac{d}{dr}\eta_{\delta}(re_{1})|\lesssim\delta^{-n-1}$ . For a unit vector $e_{1}$ , it follows that

[TABLE]

As in Lemma 3.1, we conclude now from $a\in C^{0,\alpha}(\Omega)$ in the second case that $a(x)\leqslant\delta^{\alpha}$ . Thus $a(x)|\nabla u_{\delta}|\lesssim M_{\alpha}^{n-1}(Du)(x)$ . By Proposition 3.4, the right-hand side is in $L^{2}(\Omega)$ provided $2<1+\frac{\alpha}{n-(n-1)-\alpha}=\frac{1}{1-\alpha}$ , which holds since $\alpha>\frac{1}{2}$ . Thus we can use this as the bound for dominated convergence. The rest of the proof is as before. ∎

*Remark 3.6**.*

If we consider a double phase functional $t^{p}+a(x)t^{q}$ in “normal” form, then the condition from the previous results can be written $q<p+\alpha$ . This condition has proved to be of central importance when considering bounded solutions, cf. [6, 16, 32]. In this sense, the assumption in Lemma 3.5 is probably essentially sharp.

However, more precise research has established that one may even take $q\leqslant p+\alpha$ for bounded minimizers [6, 33] (see also [21, 34] for the borderline case with unbounded minimizers). The borderline is handled using additional Hölder continuity obtained via De Giorgi technique, which in this case implies that $u\in C^{0,\gamma}(\Omega)$ for some $\gamma>0$ . Indeed, from the previous proof we can see that $a\in C^{0,1/2}(\Omega)$ would suffice if we had $u\in C^{0,\gamma}(\Omega)$ for some positive $\gamma>0$ (as one has when $p,q>1$ ) instead of $u\in L^{\infty}(\Omega)$ . However, for $BV$ problems, such higher regularity of the function cannot be expected. Therefore, the borderline $q=p+\alpha$ remains a problem for future research.

Let us also note that Ok [40] has considered double phase functionals under additional a priori integrability assumptions other than $L^{\infty}(\Omega)$ . If one could prove decay estimates $|Du(B(x,r))|\lesssim r^{\sigma}$ for $\sigma\in(n-1,n)$ when $u\in L^{s}(\Omega)$ , we could cover also this case. We do not know about such of results, so this, likewise, remains for a topic for another study.

4. Upper estimates for the $BV$ double phase functional

The concept of $\Gamma$ -convergence, introduced by De Giorgi and Franzoni [23], has been systematically presented in [8, 19]. A family of functionals $\operatorname{\mathcal{I}}_{\varepsilon}:X\to\overline{\mathbb{R}}$ is said to $\Gamma$ -converge (in topology $\tau$ ) to $\operatorname{\mathcal{I}}:X\to\overline{\mathbb{R}}$ if the following hold for every positive sequence $(\varepsilon_{i})$ converging to zero:

(a)

$\displaystyle\operatorname{\mathcal{I}}(u)\leqslant\liminf_{i\to\infty}\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i})$ for every $u\in X$ and every $(u_{i})\subset X$ $\tau$ -converging to $u$ ; 2. (b)

$\displaystyle\operatorname{\mathcal{I}}(u)\geqslant\limsup_{i\to\infty}\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i})$ for every $u\in X$ and some $(u_{i})\subset X$ $\tau$ -converging to $u$ .

Let us remark that the somewhat strange assumption ${\mathcal{H}}^{n-1}(\{a=0\}\cap\partial\Omega)=0$ in the next theorem is actually quite natural: since $\{a=0\}$ is the set where the image edges occur, we cannot identify the edge if it coincides with the image boundary $\partial\Omega$ . On the other hand, we also have no need for the jump in the function at this location, since the other part of the jump will be outside the image, and thus cannot be seen.

Theorem 4.1.

Suppose that $\Omega$ is a rectangular cuboid, $a\in C^{0,1}(\overline{\Omega})$ , and assume that $a>0$ ${\mathcal{H}}^{n-1}$ -a.e. on the boundary $\partial\Omega$ . Then $\operatorname{\mathcal{I}}_{\varepsilon}$ $\Gamma$ -converges to $\operatorname{\mathcal{I}}$ in $L^{1}(\Omega)$ topology with $X:=BV^{1,2}_{a}(\Omega)$ .

Proof.

Let us start with condition (a) in the definition of $\Gamma$ -convergence. Let $(\varepsilon_{i})$ be a positive sequence converging to zero. Let $u\in BV^{1,2}_{a}(\Omega)$ and let $(u_{i})\subset BV_{a}^{1,2}(\Omega)$ be a sequence converging to $u$ in $L^{1}(\Omega)$ . If $\liminf_{i\to\infty}\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i})=\infty$ , then there is nothing to prove, so we assume that $K:=\liminf_{i\to\infty}\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i})<\infty$ . We restrict our attention to a subsequence with $\lim_{i\to\infty}\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i})=K$ and $u_{i}\in W^{1,2}(\Omega)$ . Then $(u_{i})$ is a bounded sequence in $BV^{1,2}_{a}(\Omega)$ . By precompactness of $BV$ there exists a limit function for a subsequence such that $|Du^{b}|(\Omega)\leqslant\liminf|Du_{i}|(\Omega)$ ; by reflexivity of $W^{1,2}_{a}(\Omega)$ and $L^{2}(\Omega)$ , we obtain subsequences with $\nabla u_{i}\rightharpoonup\nabla u^{w}$ , $u_{i}\rightharpoonup u^{w}$ in $L^{2}_{a}(\Omega)$ and $u_{i}-f\rightharpoonup u^{l}-f\ \text{in }L^{2}(\Omega)$ . By $u_{i}\to u$ in $L^{1}(\Omega)$ and the uniqueness of the limit, we conclude that $u^{b}=u^{w}=u^{l}=u$ .

The weak lower semi-continuity of the Lebesgue integral yields that

[TABLE]

and, since $\varepsilon_{i}\geqslant 0$ ,

[TABLE]

Finally, for the $BV$ part we use the estimate from the previous paragraph, Young’s inequality and $(\frac{1}{1+\varepsilon_{i}})^{1/{\varepsilon_{i}}}\to\frac{1}{e}$ :

[TABLE]

By combining the above inequalities we obtain condition (a). Note that for this part we do not need the assumptions on $\Omega$ and $a$ .

Let us then move to condition (b). Since $\Omega$ is a rectangular cuboid, we can extend both the function $u$ and the weight $a$ by reflections to the rectangular cuboid with the same center but $3$ times the side-lengths. Then we use Lemma 3.1 with $F:=\overline{\Omega}$ to conclude that there exist $u_{i}\in W^{1,2}(U)$ such that

[TABLE]

We need this inequality with $\Omega$ instead of $\overline{\Omega}$ . Since $|\partial\Omega|=0$ and $u_{i}$ is a Sobolev function, $\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i},\overline{\Omega})=\operatorname{\mathcal{I}}_{\varepsilon_{i}}(u_{i},\Omega)$ . On the right-hand side, the same reason implies that

[TABLE]

The singular set of $Du$ is contained in $\{a=0\}$ because $u\in W^{1,2}_{a}(U)$ . Since $\{a=0\}\cap\partial\Omega$ has Hausdorff $(n-1)$ -measure zero by assumption, it follows by the decomposition (2.1) that $|Du|(\partial\Omega)=0$ and so $|Du|(\overline{\Omega})=|Du|(\Omega)$ . Thus we have established condition (b) of $\Gamma$ -convergence. ∎

In the previous theorem we could consider a Lipschitz domain instead of a rectangular cuboid. In this case, the extension of both $u$ and $a$ would be done by flattening the boundary with the Lipschitz map. If we use Lemma 3.5 instead of Lemma 3.1, we obtain the following variant.

Theorem 4.2.

Suppose that $\Omega$ is a bounded Lipschitz domain, $a\in C^{0,\alpha}(\overline{\Omega})$ for some $\alpha>\frac{1}{2}$ , and assume that $a>0$ ${\mathcal{H}}^{n-1}$ -a.e. on the boundary $\partial\Omega$ . Then $\operatorname{\mathcal{I}}_{\varepsilon}$ $\Gamma$ -converges to $\operatorname{\mathcal{I}}$ in $L^{1}(\Omega)$ topology with $X:=BV^{1,2}_{a}(\Omega)\cap L^{\infty}(\Omega)$ .

We use the following formulation for relaxation, which emphasizes the connection with $\Gamma$ -convergence. A functional $\overline{\operatorname{\mathcal{J}}}:X\to\overline{\mathbb{R}}$ is the relaxation of $\operatorname{\mathcal{J}}:X\to\overline{\mathbb{R}}$ in topology $\tau$ if

(a)

$\displaystyle\overline{\operatorname{\mathcal{J}}}(u)\leqslant\liminf_{i\to\infty}\operatorname{\mathcal{J}}(u_{i})$ for every $u\in X$ and every $(u_{i})\subset X$ $\tau$ -converging to $u$ ; 2. (b)

$\displaystyle\overline{\operatorname{\mathcal{J}}}(u)\geqslant\limsup_{i\to\infty}\operatorname{\mathcal{J}}(u_{i})$ for every $u\in X$ and some $(u_{i})\subset X$ $\tau$ -converging to $u$ .

The relaxation is the greatest lower-semicontinuous minorant of $\operatorname{\mathcal{J}}$ . See [8, Proposition 1.31, p. 33]. Let us write for $u\in BV(\Omega)$ that

[TABLE]

We show that the relaxation $\overline{\operatorname{\mathcal{J}}}$ of this functional equals $\operatorname{\mathcal{I}}$ . The proof is identical to Theorem 4.1, we simply take $\operatorname{\mathcal{I}}_{\varepsilon}=\operatorname{\mathcal{J}}$ for every $\varepsilon>0$ and $\operatorname{\mathcal{I}}=\overline{\operatorname{\mathcal{J}}}$ . Naturally, we could also prove an analogue to Theorem 4.2.

Corollary 4.3.

Suppose that $\Omega$ is a rectangular cuboid, $a\in C^{0,1}(\overline{\Omega})$ , and assume that $a>0$ ${\mathcal{H}}^{n-1}$ -a.e. on the boundary $\partial\Omega$ . Then $\overline{\operatorname{\mathcal{J}}}=\operatorname{\mathcal{I}}$ in $L^{1}(\Omega)$ topology.

Acknowledgement

We thank the referee for some comments regarding this manuscript.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M.K. Alaouia, T. Nabilab and M. Altanjia: On some new non-linear diffusion models for the image filtering, Applicable Anal. 93 (2014), no. 2, 269–280.
2[2] L. Ambrosio: Metric space valued functions of bounded variation. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 17 (1990), no. 3, 439–478.
3[3] L. Ambrosio, N. Fusco and D. Pallara: Functions of Bounded Variation and Free Discontinuity Problems , Oxford Mathematical Monographs, Clarendon Press, Oxford University Press, New York, 2000.
4[4] G. Aubert and P. Kornprobst: Mathematical problems in image processing, Partial differential equations and the calculus of variations , Second edition, Applied Mathematical Sciences, vol. 147, Springer, New York, 2006.
5[5] P. Baroni, M. Colombo and G. Mingione: Harnack inequalities for double phase functionals, Nonlinear Anal. 121 (2015), 206–222.
6[6] P. Baroni, M. Colombo and G. Mingione: Regularity for general functionals with double phase, Calc. Var. Partial Differential Equations 57 (2018), no. 2, paper no. 62, 48 pp.
7[7] A. Benyaiche and I. Khlifi: Harnack inequality for quasilinear elliptic equations in generalized Orlicz-Sobolev spaces, Potential Anal. , to appear, DOI:10.1007/s 11118-019-09781-z.
8[8] A. Braides: Γ Γ \Gamma -convergence for beginners , Oxford Lecture Series in Mathematics and its Applications, vol. 22, Oxford University Press, Oxford, 2002, xii+218 pp.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Double phase image restoration

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. Notation and existence of minimizers of bounded variation

Proposition 2.4**.**

Proof.

3. Lower estimates for the BVBVBV double phase functional

Lemma 3.1**.**

Proof.

Remark 3.3*.*

Proposition 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Remark 3.6*.*

4. Upper estimates for the BVBVBV double phase functional

Theorem 4.1**.**

Proof.

Theorem 4.2**.**

Corollary 4.3**.**

Acknowledgement

Proposition 2.4.

3. Lower estimates for the $BV$ double phase functional

Lemma 3.1.

*Remark 3.3**.*

Proposition 3.4.

Lemma 3.5.

*Remark 3.6**.*

4. Upper estimates for the $BV$ double phase functional

Theorem 4.1.

Theorem 4.2.

Corollary 4.3.