Quasi-Herglotz functions and convex optimization

Yevhen Ivanenko; Mitja Nedic; Mats Gustafsson; B. L. G. Jonsson,; Annemarie Luger; Sven Nordebo

arXiv:1812.08319·math.NA·February 17, 2021

Quasi-Herglotz functions and convex optimization

Yevhen Ivanenko, Mitja Nedic, Mats Gustafsson, B. L. G. Jonsson,, Annemarie Luger, Sven Nordebo

PDF

TL;DR

This paper introduces quasi-Herglotz functions, extending Herglotz functions, and demonstrates their usefulness in modeling non-passive systems through convex optimization and numerical examples.

Contribution

It defines the set of quasi-Herglotz functions, explores their properties, and applies them to model non-passive media via convex optimization techniques.

Findings

01

Quasi-Herglotz functions form a linear space extending Herglotz functions.

02

Properties like integral representations and boundary values are inherited.

03

Numerical examples show effective modeling of non-passive gain media.

Abstract

We introduce the set of quasi-Herglotz functions and demonstrate that it has properties useful in the modeling of non-passive systems. The linear space of quasi-Herglotz functions constitutes a natural extension of the convex cone of Herglotz functions. It consists of differences of Herglotz functions, and we show that several of the important properties and modeling perspectives are inherited by the new set of quasi-Herglotz functions. In particular, this applies to their integral representations, the associated integral identities or sum rules (with adequate additional assumptions), their boundary values on the real axis and the associated approximation theory. Numerical examples are included to demonstrate the modeling of a non-passive gain media formulated as a convex optimization problem, where the generating measure is modeled by using a finite expansion of B-splines and point…

Equations75

C^{+} := {z := x + i y \in C \leavevmode ∣ \leavevmode y > 0}

C^{+} := {z := x + i y \in C \leavevmode ∣ \leavevmode y > 0}

h (z) = - h (- z^{*})^{*},

h (z) = - h (- z^{*})^{*},

h (z) = a_{+} + b_{+} z + \int_{R} \frac{1 + ξ z}{ξ - z} d σ_{+} (ξ),

h (z) = a_{+} + b_{+} z + \int_{R} \frac{1 + ξ z}{ξ - z} d σ_{+} (ξ),

h (z) = b_{+} z + p.v. \int_{R} \frac{1 + ξ ^{2}}{ξ - z} d σ_{+} (ξ),

h (z) = b_{+} z + p.v. \int_{R} \frac{1 + ξ ^{2}}{ξ - z} d σ_{+} (ξ),

q (z) = h_{1} (z) - h_{2} (z)

q (z) = h_{1} (z) - h_{2} (z)

q (z) = (h_{1} + h_{3}) (z) - (h_{2} + h_{3}) (z),

q (z) = (h_{1} + h_{3}) (z) - (h_{2} + h_{3}) (z),

q (z) = a + b z + \int_{R} \frac{1 + ξ z}{ξ - z} d σ (ξ),

q (z) = a + b z + \int_{R} \frac{1 + ξ z}{ξ - z} d σ (ξ),

q (z) = b z + p.v. \int_{R} \frac{1 + ξ ^{2}}{ξ - z} d σ (ξ),

q (z) = b z + p.v. \int_{R} \frac{1 + ξ ^{2}}{ξ - z} d σ (ξ),

a_{+} + b_{+} z + \int_{R} (\frac{1}{ξ - z} - \frac{ξ}{1 + ξ ^{2}}) d β_{+} (ξ),

a_{+} + b_{+} z + \int_{R} (\frac{1}{ξ - z} - \frac{ξ}{1 + ξ ^{2}}) d β_{+} (ξ),

\int_{R} \frac{1}{1 + ξ ^{2}} d β_{+} (ξ) < \infty.

\int_{R} \frac{1}{1 + ξ ^{2}} d β_{+} (ξ) < \infty.

q (z) = a + b z + \int_{R} (\frac{1}{ξ - z} - \frac{ξ}{1 + ξ ^{2}}) d β (ξ),

q (z) = a + b z + \int_{R} (\frac{1}{ξ - z} - \frac{ξ}{1 + ξ ^{2}}) d β (ξ),

C^{0, α} (Ω) \subset C (Ω) \subset L^{p} (w, Ω),

C^{0, α} (Ω) \subset C (Ω) \subset L^{p} (w, Ω),

q (x) = a + b x + p.v. \int_{R} \frac{1 + ξ x}{ξ - x} d σ (ξ) + i π (1 + x^{2}) σ_{Ω}^{'} (x),

q (x) = a + b x + p.v. \int_{R} \frac{1 + ξ x}{ξ - x} d σ (ξ) + i π (1 + x^{2}) σ_{Ω}^{'} (x),

{z \in C^{+} \leavevmode ∣ \leavevmode θ < Arg (z) < π - θ} .

{z \in C^{+} \leavevmode ∣ \leavevmode θ < Arg (z) < π - θ} .

q (z) = \frac{a _{- 1}}{z} + a_{0} + a_{1} z + \dots + a_{M} z^{M} + o (z^{M}) as z \vec{^} 0.

q (z) = \frac{a _{- 1}}{z} + a_{0} + a_{1} z + \dots + a_{M} z^{M} + o (z^{M}) as z \vec{^} 0.

q(z)=b_{1}z+b_{0}+\frac{b_{-1}}{z}+\ldots+\frac{b_{-K}}{z^{K}}+o\Big{(}\frac{1}{z^{K}}\Big{)}\quad\quad{\rm as}\quad z\hat{\to}\infty.

q(z)=b_{1}z+b_{0}+\frac{b_{-1}}{z}+\ldots+\frac{b_{-K}}{z^{K}}+o\Big{(}\frac{1}{z^{K}}\Big{)}\quad\quad{\rm as}\quad z\hat{\to}\infty.

z \vec{^} 0 lim z q (z) = - σ ({0}),

z \vec{^} 0 lim z q (z) = - σ ({0}),

z \vec{^} \infty lim \frac{q ( z )}{z} = b,

z \vec{^} \infty lim \frac{q ( z )}{z} = b,

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{- 2 N_{0}} Im {q (x + i y)} d x

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{- 2 N_{0}} Im {q (x + i y)} d x

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{2 N_{\infty}} Im {q (x + i y)} d x

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{2 N_{\infty}} Im {q (x + i y)} d x

\lim_{\varepsilon\to 0^{+}}\lim_{y\to 0^{+}}\frac{1}{\pi}\int_{\varepsilon<|x|<\frac{1}{\varepsilon}}x^{k}\mathrm{Im}\{q(x+\mathrm{i}y)\}\mathrm{d}x=\left\{\begin{array}[]{ll}a_{-k-1},&-2N_{0}\leq k\leq-3,\\ a_{-k-1}-b_{-k-1},&-2\leq k\leq 0,\\ -b_{-k-1},&1\leq k\leq 2N_{\infty}\end{array}\right.

\lim_{\varepsilon\to 0^{+}}\lim_{y\to 0^{+}}\frac{1}{\pi}\int_{\varepsilon<|x|<\frac{1}{\varepsilon}}x^{k}\mathrm{Im}\{q(x+\mathrm{i}y)\}\mathrm{d}x=\left\{\begin{array}[]{ll}a_{-k-1},&-2N_{0}\leq k\leq-3,\\ a_{-k-1}-b_{-k-1},&-2\leq k\leq 0,\\ -b_{-k-1},&1\leq k\leq 2N_{\infty}\end{array}\right.

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{- 2 N_{0}} Im {h_{1} (x + i y)} d x .

ε \to 0^{+} lim y \to 0^{+} lim ε < ∣ x ∣ < \frac{1}{ε} \int x^{- 2 N_{0}} Im {h_{1} (x + i y)} d x .

ε \to 0^{+} lim y \to 0^{+} lim \frac{2}{π} \int_{ε}^{ε^{- 1}} x^{k} Im {q (x + i y)} d x .

ε \to 0^{+} lim y \to 0^{+} lim \frac{2}{π} \int_{ε}^{ε^{- 1}} x^{k} Im {q (x + i y)} d x .

d := q \in W^{α, p} (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

d := q \in W^{α, p} (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

d = q \in W^{m, p} (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

d = q \in W^{m, p} (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

d = q \in W (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

d = q \in W (w, Ω) in f ∥ q - F ∥_{L^{p} (w, Ω)} .

q (x) = a + b x + i = 1 \sum M \frac{p _{i}}{ξ _{i} - x} + p.v. \int_{- \infty}^{\infty} (\frac{1}{ξ - x} - \frac{ξ}{1 + ξ ^{2}}) β^{'} (ξ) d ξ + i π β^{'} (x)

q (x) = a + b x + i = 1 \sum M \frac{p _{i}}{ξ _{i} - x} + p.v. \int_{- \infty}^{\infty} (\frac{1}{ξ - x} - \frac{ξ}{1 + ξ ^{2}}) β^{'} (ξ) d ξ + i π β^{'} (x)

= \overset{a}{ˇ} + b x + i = 1 \sum M \frac{p _{i}}{ξ _{i} - x} + p.v. \int_{- \infty}^{\infty} \frac{1}{ξ - x} β^{'} (ξ) d ξ + i π β^{'} (x)

Im {q_{N} (x)} = π β^{'} (x) = n = 1 \sum N c_{n} p_{n} (x),

Im {q_{N} (x)} = π β^{'} (x) = n = 1 \sum N c_{n} p_{n} (x),

Re {q_{N} (x)} = \overset{a}{ˇ} + b x + i = 1 \sum M \frac{p _{i}}{ξ _{i} - x} + n = 1 \sum N c_{n} \overset{p}{^}_{n} (x),

Re {q_{N} (x)} = \overset{a}{ˇ} + b x + i = 1 \sum M \frac{p _{i}}{ξ _{i} - x} + n = 1 \sum N c_{n} \overset{p}{^}_{n} (x),

\begin{array}[]{llll}&\mathrm{minimize}&&\|q-F\|_{\mathrm{L}^{p}(w,\Omega)}\\[2.84526pt] &\mathrm{subject\ to}&&b_{\mathrm{lower}}(x)\leq\beta^{\prime}(x)\leq b_{\mathrm{upper}}(x),\end{array}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\xpatchcmd

Quasi-Herglotz functions and convex optimization

Y. Ivanenko1, M. Nedic2, M. Gustafsson3, B. L. G. Jonsson4, A. Luger2, S. Nordebo1

1 Department of Physics and Electrical Engineering, Linnæus University, 351 95 Växjö, Sweden. E-mail: {yevhen.ivanenko,sven.nordebo}@lnu.se.

2 Department of Mathematics, Stockholm University 106 91 Stockholm, Sweden. E-mail: {mitja,luger}@math.su.se.

3 Department of Electrical and Information Technology, Lund University, Box 118, 221 00 Lund, Sweden. E-mail: [email protected].

4 School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, 100 44 Stockholm, Sweden. E-mail: [email protected].

Abstract

We introduce the set of quasi-Herglotz functions and demonstrate that it has properties useful in the modeling of non-passive systems. The linear space of quasi-Herglotz functions constitutes a natural extension of the convex cone of Herglotz functions. It consists of differences of Herglotz functions and we show that several of the important properties and modeling perspectives are inherited by the new set of quasi-Herglotz functions. In particular, this applies to their integral representations, the associated integral identities or sum rules (with adequate additional assumptions), their boundary values on the real axis and the associated approximation theory. Numerical examples are included to demonstrate the modeling of a non-passive gain media formulated as a convex optimization problem, where the generating measure is modeled by using a finite expansion of B-splines and point masses.

1 Introduction

It is well known that an admittance passive system (admittance, impedance, electromagnetic constitutive relations, etc.), i.e., a system that absorbs more energy than it emits [1], can be represented mathematically by a symmetric Herglotz function (or Positive Real (PR) function), see e.g., [2, 3, 4, 5, 6, 7]. The condition of passivity implies, among other things, that the system also has to be causal [8]. Furthermore, the integral representation formula for symmetric Herglotz functions leads to integral identities or sum rules [4, 6] that are useful to derive physical bounds in a variety of technical applications such as e.g., radar absorbers [9], passive metamaterials [10], high-impedance surfaces [11], antennas [12, 13], reflection coefficients [14], waveguides [15], and periodic structures [16], only to mention a few. The integral representation formula can also be utilized in a convex optimization setting to construct an optimal approximating passive realization of a desired target response [17, 18], which is typically given on a finite closed interval of the real (frequency) axis. Optimal realizations of passive metamaterials are typical examples, where it is e.g., desired to synthesize low-loss materials with negative refractive index over a frequency interval [10, 17, 18]. However, there exist many practically important systems that are causal, but not passive, and thus we introduce a new class of functions to model them.

As a motivation for the study of the new class of functions, we refer to the use of gain media which has been proposed to improve the light localization effects in plasmonics, with applications such as plasmon waveguides, extraordinary transmission, perfect lenses, artificial magnetism, negative refractive index, cloaking, tunneling, high directivity radiators, optical nanocircuits, nanowires etc., see e.g., [19, 20, 21, 22, 23, 24, 25] with references. Here, the use of gain media refers to the use of fluorescent dyes through optical pumping for which there exist explicit Lorentz type of resonance models in the standard laser literature, see e.g., [21, 26, 27, 25, 28] with references. Hence, laser pumping is a physical mechanism that allows for a linearized description of the medium in terms of a dielectric permittivity that can have a negative imaginary part over some frequency intervals. Naturally, it has been recognized that such models must satisfy causality and the associated Kramers-Kronig relations [29]. However, our purpose of employing the new class of functions in this context is to add the restrictions imposed by passivity outside the (non-passive) emitting frequency range for determination of an optimal realization of non-passive medium characterized by permittivity function. It is emphasized that the term realizability is employed here in the sense of realizability theory as in [30]. This means that a given system response is realizable, not as a physical system, but rather as a function possessing mathematically well defined properties of physical significance, such as causality and passivity [30], and also as in our case, having some a priori assumed regularity properties regarding its boundary values on the real (frequency) axis.

The boundary values of analytic functions representing causal systems are treated classically in ${\rm L}^{2}$ spaces as in Titchmarsh’s theorem [5, 29] or in the sense of tempered distributions as in [2, 31]. There are a few results concerned with approximation theory or interpolation problems associated with partial information on the real axis (or on the unit circle). For example, bounds on the dispersion for finite-frequency-range Kramers-Kronig relations based on Stieltjes functions are presented in [32], and in [33], an approximation theory is given with density results for Hardy space approximants targeted for ${\rm L}^{p}$ functions defined on subsets of the circle. Furthermore, a related bounded extremal problem is examined in [34] with point-wise constraints on the complementary part of the circle.

In this paper, we are interested in extending the class of admittance passive systems to include certain causal, non-passive systems. This extension is aimed to preserve the integral representation formula for the system, as well as, in certain cases, a sum rule. As a characterization of the full class of functions that satisfy the sum-rule identities seems out of reach, we use a class of functions that includes all Herglotz functions, and for which the sum-rule identities still hold under some appropriate additional assumptions regarding their asymptotic expansions. Moreover, it is also desirable that the new class of functions can be incorporated in an approximation theory, similar to the one for Herglotz functions [18]. It turns out that differences of Herglotz functions are suitable in this sense and we define the (real) vector space generated by Herglotz functions as the space of quasi-Herglotz functions. As for the approximation theory, we follow a slightly different route than [33, 34] and consider, as approximants, certain subspaces of quasi-Herglotz functions which are Hölder continuously extendable to a neighborhood of a given approximation interval on the real line, equipped with the topology from a larger ${\rm L}^{p}$ space. This is a formulation that will imply that even smaller subspaces generated by finite B-spline expansions (useful in convex optimization) will be dense in the larger set of approximants. Numerical examples are included to demonstrate the approximation approach by modeling of a given non-passive system by solving a convex optimization problem. Here the generating measure is modeled by using a finite expansion of B-splines and point masses.

The rest of the paper is organized as follows: In Section 2, we introduce the set of quasi-Herglotz functions and discuss their basic properties, integral representations and boundary values. In Section 3, the sum rules are formulated and proved. In Section 4, the mathematical approximation theory and related convex optimization is formulated. It is based on certain assumptions regarding the Hölder continuity of the approximating quasi-Herglotz functions extended to the real line. In Section 5, the theory is illustrated by numerical examples, and the paper ends with conclusions in Section 6.

2 Quasi-Herglotz functions

2.1 Background

An important function class in applied mathematics is the class of so-called Herglotz functions. Known also under a variety of different names, such as Herglotz-Nevanlinna functions, Pick functions and R-functions, these are analytic functions on the upper half-plane

[TABLE]

having non-negative imaginary part [4, 3]. A major significance of this class of functions lies in the fact that the subclass of all symmetric Herglotz functions, i.e., Herglotz functions with the property

[TABLE]

is closely connected with passive systems [2]. Above, the superscript $(\>\cdot\>)^{*}$ denotes complex conjugation.

One of the most powerful tools in the theory of Herglotz functions is the existence of an integral representation formula [4, 3]. This well-known formula states that a function $h\colon\mathbb{C}^{+}\to\mathbb{C}$ is a Herglotz function if and only if it can be written, for any $z\in\mathbb{C}^{+}$ , as

[TABLE]

where $a_{+}\in\mathbb{R}$ , $b_{+}\geq 0$ and $\sigma_{+}$ is a finite positive Borel measure on $\mathbb{R}$ , and the subscrpit $(\>\cdot\>)_{+}$ is used to highlight the fact that these parameters represent a function with non-negative imaginary part. Furthermore, the correspondence between the function $h$ and the triple of its representing parameters $(a_{+},b_{+},\sigma_{+})$ is unique.

If we are, instead, considering a symmetric Herglotz function, condition (2) implies first that the function must take purely imaginary values along the imaginary axis, yielding that the coefficient $a_{+}$ from representation (3) must be zero. Furthermore, the Stieltjes inversion formula [3] implies that the measure $\sigma_{+}$ from representation (3) must be even, i.e., $\sigma_{+}(U)=\sigma_{+}(-U)$ for any Borel measurable set $U\subseteq\mathbb{R}$ , where $-U:=\{x\in\mathbb{R}\leavevmode\nobreak\ |\leavevmode\nobreak\ -x\in U\}$ .

As such, all symmetric Herglotz functions $h$ admit, for $z\in\mathbb{C}^{+}$ , an integral representation of the form

[TABLE]

where $b_{+}$ and $\sigma_{+}$ are as in representation (3), with the additional constraint that the measure $\sigma_{+}$ is symmetric, cf. [2, 3, 4, 5, 7] and $\mathrm{p.v.}$ denotes that the integral in representation (4) taken as the Cauchy principal value at infinity. Observe that it is necessary to view the above integral in the principal value sense to ensure convergence. Indeed, for any fixed $z\in\mathbb{C}^{+}$ , the integrand grows linearly at $\pm\infty$ and is, hence, not-necessarily integrable with respect to the measure $\sigma_{+}$ . Note, furthermore, that this is not the case in representation (3), where the integrand is a bounded function on $\mathbb{R}$ for any fixed $z\in\mathbb{C}^{+}$ .

2.2 Basic properties

We now introduce the following class of analytic functions on the upper half-plane.

Definition 2.1

An analytic function $q\colon\mathbb{C}^{+}\to\mathbb{C}$ is called a quasi-Herglotz function if there exist two Herglotz functions $h_{1}$ and $h_{2}$ , such that

[TABLE]

for any $z\in\mathbb{C}^{+}$ . Analogously, an analytic function $q\colon\mathbb{C}^{+}\to\mathbb{C}$ is called a symmetric quasi-Herglotz function if there exist two symmetric Herglotz functions $h_{1}$ and $h_{2}$ , such that equality (5) holds for all $z\in\mathbb{C}^{+}$ . The set of all quasi-Herglotz functions is denoted by $\mathcal{Q}$ , while the set of all symmetric quasi-Herglotz functions is denoted by $\mathcal{Q}_{\mathrm{sym}}$ .

We mention two trivial observations. First, any Herglotz (resp. symmetric Herglotz) function is also a quasi-Herglotz (resp. symmetric quasi-Herglotz) function, as we only need to take the function $h_{2}$ in Definition 2.1 to be identically equal to zero. Second, there is an element of non-uniqueness in Definition 2.1. If an analytic function $q$ can be written as in formula (5) for some Herglotz (resp. symmetric Herglotz) functions $h_{1}$ and $h_{2}$ , then it can also be written as

[TABLE]

where $z\in\mathbb{C}^{+}$ , for any other Herglotz (resp. symmetric Herglotz) function $h_{3}$ .

2.3 Integral representations

It is an immediate consequence of the integral representation formulas (3) and (4) that quasi-Herglotz functions in the sets $\mathcal{Q}$ and $\mathcal{Q}_{\mathrm{sym}}$ admit similar integral representations. Any function $q\in\mathcal{Q}$ can be written, for $z\in\mathbb{C}^{+}$ , as

[TABLE]

where $a$ and $b$ are real numbers and $\sigma$ is a signed Borel measure. In particular, if $q$ is given as $q=h_{1}-h_{2}$ , then $a=a_{+,1}-a_{+,2}$ , $b=b_{+,1}-b_{+,2}$ and $\sigma=\sigma_{+,1}-\sigma_{+,2}$ where, for $j=1,2$ , the parameters $a_{+,j},b_{+,j}$ and $\sigma_{+,j}$ are the representing parameter for the Herglotz function $h_{j}$ in the sense of representation (3).

Similarly, any function $q=h_{1}-h_{2}$ in the class $\mathcal{Q}_{\mathrm{sym}}$ can be written, for $z\in\mathbb{C}^{+}$ , as

[TABLE]

where $b$ and $\sigma$ are as in the previous case.

Note that, despite the element of non-uniqueness in Definition 2.1 discussed in Section 2.2, the triple of representing parameters $(a,b,\sigma)$ corresponding to a quasi-Herglotz function $q$ in the sense of representation (7) is determined uniquely by the function $q$ .

The integral representation formula (3) for ordinary Herglotz functions may also be written in terms of a not necessarily finite measure $\beta_{+}$ . Indeed, one can show that the right-hand side of representation (3) may equivalently be written as

[TABLE]

where $a_{+}$ and $b_{+}$ are as before and $\beta_{+}$ is a positive Borel measure on $\mathbb{R}$ satisfying the growth condition

[TABLE]

However, an integral representation of this form cannot yield an integral representation for all quasi-Herglotz functions, as the difference of two measures satisfying the growth condition (10) is not necessarily well-defined. Nevertheless, some quasi-Herglotz functions $q$ do admit an integral representation of the form

[TABLE]

and one case where this happens, which will appear later in Sections 4 and 5, is when the measure $\sigma$ from representation (7) has compact support. Then, the measure $\beta$ in representation (11) may be defined via $\mathrm{d}\beta(\xi)=(1+\xi^{2})\mathrm{d}\sigma(\xi)$ .

2.4 Boundary values

In general, Herglotz functions, as well as quasi-Herglotz functions, and in particular their imaginary parts, have boundary values (on the real line) only in the distributional sense, see e.g., [2, 6, 18, 31]. In what follows, however, we will be interested in complex-valued functions on some interval $\Omega\subset\mathbb{R}$ which appear as continuous extensions of suitable quasi-Herglotz functions.

First, we want to mention certain inclusions of function spaces, which will be very useful in Section 4. As usual, we let $C(\Omega)$ denote the Banach space consisting of all complex-valued continuous functions defined on some compact interval $\Omega\subset\mathbb{R}$ equipped with the standard max-norm $\|\cdot\|_{\infty}$ . The Hölder space with exponent $0<\alpha<1$ is denoted $C^{0,\alpha}(\Omega)$ and the corresponding norm is denoted $\|\cdot\|_{\alpha}$ , cf. [35, pp. 94-104]. Further, let $\mathrm{L}^{p}(w,\Omega)$ denote the Banach space with norm $\|f\|_{\mathrm{L}^{p}(w,\Omega)}=\left(\int_{\Omega}w(x)|f(x)|^{p}\mathrm{d}x\right)^{1/p}$ , where $1\leq p<\infty$ and $w>0$ denotes a positive continuous weight function on $\Omega$ , cf. [36]. The Banach space $\mathrm{L}^{\infty}(w,\Omega)$ is similarly equipped with the norm $\|f\|_{\mathrm{L}^{\infty}(w,\Omega)}$ defined by taking the essential supremum [36] of the function $w|f|$ . Then, the spaces defined above satisfy the following inclusions

[TABLE]

where $0<\alpha<1$ and $1\leq p\leq\infty$ .

Second, recall that the property that assures the existence of boundary values of quasi-Herglotz functions (i.e., of both the real and imaginary parts) is Hölder continuity of the density of the measure. More precisely, the following theorem holds, see e.g., [18, Thm. 2.2] for the argument.

Theorem 2.2

Let $q$ be a quasi-Herglotz function with representing parameters $(a,b,\sigma)$ and let $\Omega\subset\mathbb{R}$ be a compact interval. Then the function $q$ can be Hölder continuously (with Hölder exponent $\alpha$ ) extended to $\Omega\cup\mathbb{C}^{+}$ if and only if the measure $\sigma$ is absolutely continuous on the closure of some open neighborhood $\mathcal{O}$ of $\Omega$ and the corresponding restriction $\sigma|_{\overline{\mathcal{O}}}$ has a Hölder continuous density $\sigma^{\prime}_{\Omega}$ (with Hölder exponent $\alpha$ ), i.e., it belongs to the space $C^{0,\alpha}(\overline{\mathcal{O}})$ . In this case, for every $x\in\Omega$ , this extension is given by

[TABLE]

where the integral is taken as a Cauchy principal value both at infinity and at the singularity $x\in\mathbb{R}$ .

3 Sum rules

One of the most important properties of Herglotz functions are the, so-called, sum-rule identities [6, Thm. 4.1] and [4]. These identities relate weighted integrals of the imaginary part of a Herglotz function, via the moments of its representing measure, to the coefficients of the asymptotic expansion of the function at the points zero and infinity.

The asymptotic expansions we are interested in are always taken with respect to non-tangential limits in a Stoltz domain. A Stoltz domain with parameter $\theta\in(0,\frac{\pi}{2}]$ is the angular domain

[TABLE]

As such, the limit $z\hat{\to}0$ (resp. $z\hat{\to}\infty$ ) denotes that the limit $|z|\to 0$ (resp. $|z|\to\infty$ ) is taken in any Stoltz domain as above.

Consider now the following definitions.

Definition 3.1

Let $q$ be a quasi-Herglotz function. We say that $q$ admits, at $z=0$ , an asymptotic expansion of order $M\geq-1$ if there exist real numbers $a_{-1},a_{0},a_{1},\ldots,a_{M}$ such that $q$ can be written as

[TABLE]

Definition 3.2

Let $q$ be a quasi-Herglotz function. We say that $q$ admits, at $z=\infty$ , an asymptotic expansion of order $K\geq-1$ if there exist real numbers $b_{1},b_{0},b_{-1},\ldots,b_{-K}$ such that $q$ can be written as

[TABLE]

At $z=0$ , an expansion of order $M=-1$ always exists for any quasi-Herglotz function $q$ , as it always exists for any two Herglotz functions $h_{1}$ and $h_{2}$ , cf. [3, 6], yielding that

[TABLE]

where the signed measure $\sigma$ is as in representation (7). Similarly, at $z=\infty$ , an expansion of order $K=-1$ always exists for any quasi-Herglotz function $q$ , as it always exists for any two Herglotz functions $h_{1}$ and $h_{2}$ , cf. [3, 6], yielding that

[TABLE]

where the number $b$ is as in representation (7). Furthermore, the number $b$ equals the number $b_{1}$ appearing in Definition 3.2.

We may now derive the following sum-rule theorem.

Theorem 3.3

The following two statements hold.

(i)

Let $q=h_{1}-h_{2}$ be a quasi-Herglotz function, such that at least one of the Herglotz functions $h_{1}$ and $h_{2}$ admits, at $z=0$ , an asymptotic expansion (15) of some order $M\geq-1$ . Then, for some integer $N_{0}\geq 1$ with $2N_{0}-1\leq M$ , the limit

[TABLE]

exists as a finite number if and only if the function $q$ admits, at $z=0$ , an asymptotic expansion (15) of order $2N_{0}-1$ .

(ii)

Let $q=h_{1}-h_{2}$ be a quasi-Herglotz function, such that at least one of the Herglotz functions $h_{1}$ and $h_{2}$ admits, at $z=\infty$ , an asymptotic expansion (16) of some order $K\geq-1$ . Then, for some integer $N_{\infty}\geq 0$ with $2N_{\infty}+1\leq K$ , the limit

[TABLE]

exists as a finite number if and only if the function $q$ admits, at $z=\infty$ , an asymptotic expansion (16) of order $2N_{\infty}+1$ .

Furthermore, the identities

[TABLE]

are valid

•

for $k=-2N_{0},-2N_{0}+1,\ldots,-2$ if there exists an integer $N_{0}$ satisfying statement (i),

•

for $k=0,1,\ldots,2N_{\infty}$ if there exists an integer $N_{\infty}$ satisfying statement (ii),

•

for $k=-1$ if there exist integers $N_{0}$ and $N_{\infty}$ satisfying statements (i) and (ii), respectively.

In formula (21), the numbers $a_{-1},a_{0},a_{1},\ldots,a_{2N_{0}-1}$ are as in Definition 3.1 and the numbers $b_{-1},b_{-2},\ldots,b_{-(2N_{\infty}+1)}$ are as in Definition 3.2.

Proof

In the case of statement (i), we may, without loss of generality, assume that, if we write $q=h_{1}-h_{2}$ , it is the function $h_{2}$ that admits, at $z=0$ , an asymptotic expansion (15) of some order $M\geq-1$ .

Then, it follows from e.g., [6, Thm. 4.1], that the limit (19) for the function $h_{2}$ exists and, moreover, that the sum rules identities (21) hold for the function $h_{2}$ for all $k$ between $-M-1$ and $-2$ . Thus, the existence of the limit (19) for the function $q=h_{1}-h_{2}$ is equivalent to the existence of the limit

[TABLE]

and the existence of an asymptotic expansion of the function $q$ of the form (15) is equivalent to the existence of an analogous expansion of the function $h_{1}$ . Statement (i) is then established by applying the sum-rule for the function $h_{1}$ , namely [6, Thm. 4.1].

The proof of statement (ii) follows an analogous reasoning. $\square$

Remark 3.4

For $q=h_{1}-h_{2}$ , the requirement of statement (i) in Theorem 3.3 will certainly be satisfied if the representing measure of at least one of the functions $h_{1}$ or $h_{2}$ has support that does not include the point zero. Similarly, the requirement of statement (ii) in Theorem 3.3 will certainly be satisfied if the representing measure of at least one of the functions $h_{1}$ or $h_{2}$ has compact support.

Remark 3.5

If, in Theorem 3.3, we have a function $q\in\mathcal{Q}_{\mathrm{sym}}$ , all integrals with odd powers $k$ on the left-hand side of identity (21) are zero due to the symmetry of the measure. Furthermore, for even powers $k$ , these integrals may be written as

[TABLE]

Remark 3.6

Theorem 3.3 cannot be formulated for arbitrary quasi-Herglotz functions. Examples show that it even does not hold for all meromorphic quasi-Herglotz functions, e.g., it can be shown that the quasi-Herglotz function $q(z):=\tan(z)-\mathrm{i}$ admits, at $z=\infty$ , an asymptotic expansion of order $K=0$ , but there exists no integer $N_{\infty}$ that would fulfill statement (ii) of Theorem 3.3.

4 Approximation and optimization based on quasi-Herglotz functions

In this section, we derive the rationale for employing convex optimization as a tool to approximate a given continuous function defined on a compact approximation domain by certain quasi-Herglotz functions. The approximating quasi-Herglotz functions are first restricted to a certain subspace characterized by a particular requirement regarding their Hölder continuity on the approximation domain. Then it is shown that a smaller set of quasi-Herglotz functions generated by finite B-spline expansions (suitable for convex optimization) is dense in the larger space of Hölder continuous quasi-Herglotz functions in the topology induced by any $\mathrm{L}^{p}$ -norm. In essence, this development constitutes a straightforward, but very important extension of previous results derived for Herglotz functions [18].

4.1 Approximation theory based on quasi-Herglotz functions

To make the statements given above precise, we fix the approximation domain $\Omega\subset\mathbb{R}$ as a finite union of closed and bounded intervals on the real axis.

With a finite B-spline expansion we refer to a finite linear combination of B-splines of any order $m\geq 2$ . A B-spline of order $m$ is a compactly supported positive basis spline function which is piecewise polynomial of order $m-1$ , i.e., linear, quadratic, cubic, etc., and which is defined by $m+1$ break-points as described in e.g., [37, 38]. With a finite uniform B-spline expansion we refer to a finite B-spline expansion with equidistant break-points.

The following definitions and theorems are similar as in [18] but extended to the current situation with quasi-Herglotz functions. Let $\Omega$ be given as above and let $w>0$ denote a positive continuous weight function, and let $0<\alpha<1$ , $1\leq p\leq\infty$ and $m\geq 2$ .

Definition 4.1

Let $W^{\alpha,p}(w,\Omega)\subset\mathrm{L}^{p}(w,\Omega)$ denote the subspace of all complex-valued functions $q\in C^{0,\alpha}(\Omega)$ with the following property: There exists a quasi-Herglotz function that has a Hölder continuous (with exponent $\alpha$ ) extension to the closure of some neighboorhood ${\cal O}$ of $\Omega$ which coincides with $q$ on $\Omega$ .

Note that we consider $W^{\alpha,p}(w,\Omega)$ as a subspace of $\mathrm{L}^{p}(w,\Omega)$ and hence equipped with the topology from $\mathrm{L}^{p}(w,\Omega)$ .

Remark 4.2

If it is clear from the context, in the following, we denote by $q$ the quasi-Herglotz function as well as the extension to $\mathbb{C}^{+}\cup{\cal O}$ and its restriction to $\Omega$ .

Definition 4.3

Let $W^{m,p}(w,\Omega)\subset W^{\alpha,p}(w,\Omega)$ denote the subspace of those functions for which the signed measure $\beta$ (in (11)) of the quasi-Herglotz function $q$ in Definition 4.1 is absolutely continuous with density $\beta^{\prime}$ that is a finite uniform B-spline expansion of order $m$ .

Note that the sets $W^{\alpha,p}(w,\Omega)$ and $W^{m,p}(w,\Omega)$ are independent of $p$ and $w$ , but are equipped with the topology of $\mathrm{L}^{p}(w,\Omega)$ .

Remark 4.4

The signed measure $\beta$ is a not necessarly finite signed Borel measure and can be represented in terms of the finite signed measure $\sigma$ as described in Section 2.3.

The following Theorem is a straightforward generalization of [18, Thm. 3.2] to the situation of quasi-Herglotz functions instead of Herglotz functions.

Theorem 4.5

The subspace $W^{m,p}(w,\Omega)$ is dense in $W^{\alpha,p}(w,\Omega)$ with respect to the topology of $\mathrm{L}^{p}(w,\Omega)$ .

Proof

Let $\varepsilon>0$ and let a function $q\in W^{\alpha,p}(w,\Omega)$ be given. Since both the positive and the negative part of a real valued Hölder continuous function are again Hölder continuous, it follows that $q$ can be written as $q=h_{1}-h_{2}$ with functions $h_{1}$ and $h_{2}$ belonging to the convex cone $V^{\alpha,p}(w,\Omega)$ , similar to $W^{\alpha,p}(w,\Omega)$ but generated by extensions of Herglotz functions rather than quasi-Herglotz functions. Then, Theorem 3.2 in [18, pp. 11-14] implies that there exist functions $\widetilde{h}_{1}$ and $\widetilde{h}_{2}$ belonging to the convex cone $W^{m,p}(w,\Omega)$ such that $\|\widetilde{h}_{i}-h_{i}\|_{\mathrm{L}^{p}(w,\Omega)}<\frac{\varepsilon}{2}$ for $i=1,2$ . Hence for $\widetilde{q}:=\widetilde{h}_{1}-\widetilde{h}_{2}\in W^{m,p}(w,\Omega)$ it holds $\|\widetilde{q}-q\|_{\mathrm{L}^{p}(w,\Omega)}<\varepsilon$ , which finishes the proof. $\square$

Definition 4.6

Let $F\in C(\Omega)$ and consider the problem to approximate $F$ based on the set of functions $q\in W^{\alpha,p}(w,\Omega)$ . The greatest lower bound on the approximation error over the subspace $W^{\alpha,p}(w,\Omega)$ is defined by

[TABLE]

Note that the distance $d$ depends on the chosen topology of $\mathrm{L}^{p}(w,\Omega)$ , but is independent of the Hölder exponent $\alpha$ , cf. [18]. The following theorem demonstrates the usefulness of employing finite B-spline expansions in the associated approximation problem.

Theorem 4.7

The greatest lower bound on the approximation error defined in (24) is given by

[TABLE]

The theorem is a straightforward consequence of Theorem 4.5 together with an application of the triangle inequality. It is noted that the distance $d$ is independent of the Hölder exponent $\alpha$ as well as of the spline order $m$ , cf. [18]. The following obvious corollary can be used when the measure of the approximating quasi-Herglotz function contains a set of point masses. Such cases will be discussed in Sections 4.2 and 5.

Corollary 4.8

Let $W(w,\Omega)\subset W^{\alpha,p}(w,\Omega)$ be a set which contains $W^{m,p}(w,\Omega)$ . Then, $W(w,\Omega)$ is dense in $W^{\alpha,p}(w,\Omega)$ and it holds that

[TABLE]

4.2 Convex optimization with B-splines

The significance of the Theorems 4.5 and 4.7 is that B-spline expansions [37, 39, 38], which are well suited for numerical optimization [18, 40, 41], can be used to approximate a given continuous function $F$ with arbitrary small deviation from the greatest lower bound defined in (24). A detailed description of the associated convex optimization problem is given as follows.

Let the approximation domain $\Omega$ , the target function $F\in C(\Omega)$ and the weight function $w\in C(\Omega)$ be given as above, and let $0<\alpha<1$ , $1\leq p\leq\infty$ and $m\geq 2$ . As approximating functions we can use functions $q$ from a set $W(w,\Omega)$ defined by the following representations

[TABLE]

for $x\in\Omega$ , and where the second part of the integral in (27) has been absorbed into the constant $\check{a}$ in (28). In (27) and (28) the density $\beta^{\prime}$ is a finite uniform B-spline expansion as in Definition 4.3, and a finite number of point masses at $\xi_{i}\notin\Omega$ with real-valued amplitudes $p_{i}$ , $i=1,\ldots,M$ , have also been included. It is noted that the set $W(w,\Omega)$ satisfies the condition $W^{m,p}(w,\Omega)\subset W(w,\Omega)\subset W^{\alpha,p}(w,\Omega)$ of Corollary 4.8.

In particular, we employ here B-spline basis functions $p_{n}(x)$ of fixed polynomial order $m-1$ for $n=1,\ldots,N$ , where $N$ is the number of B-splines, and $\hat{p}_{n}(x)$ the (negative) Hilbert transform [29] of the B-spline functions. Explicit formulas for general uniform as well as non-uniform B-splines and their Hilbert transforms are given in [42, Sec. 3.1]. Let $q_{N}\in W$ denote approximating functions represented as in (28), and hence

[TABLE]

and

[TABLE]

for $x\in\Omega$ , and where $c_{n}$ are the corresponding B-spline expansion coefficients. Note that all the parameters $\check{a}$ , $b$ , $\{p_{i}\}_{i=1}^{M}$ and $\{c_{n}\}_{n=1}^{N}$ , as well as the break-points of the B-splines defined above depend on $N$ . It is further assumed that the support of $q_{N}$ grows with $N$ at the same time as the distance $\delta$ between breakpoints decreases, e.g., as $|\textrm{supp}\{q_{N}\}|=\sqrt{N}$ and $\delta=|\textrm{supp}\{q_{N}\}|/N=1/\sqrt{N}$ . For a fixed $N$ , the minimization of the norm of the approximation error $\|q_{N}-F\|_{\mathrm{L}^{p}(w,\Omega)}$ is a finite-dimensional convex optimization problem over the real parameters $\check{a}$ , $b$ , $p_{i}$ and $c_{n}$ , and we denote the optimal value $d_{N}$ . The important implication of Theorem 4.7 and Corollary 4.8 is that $d_{N}\rightarrow d$ as $N\rightarrow\infty$ . Finally, it is noted that for a numerical implementation using e.g., the CVX MATLAB software for disciplined convex programming [41] the calculation of the norm above must be approximated based on a finite set of sample points in $\Omega$ . However, due to the uniform continuity of all functions involved, this can, in principle, be done within arbitrary numerical accuracy.

Now that we have established the rationale for using numerical convex optimization as a tool for approximating a given continuous function based on the set of quasi-Herglotz functions, we can also expand the setting by incorporating any additional convex constraints of interest, see also [17, 43, 44]. For example, we can include upper and lower bounds on the density $\beta^{\prime}$ stated as

[TABLE]

where the optimization is over $q\in W(w,\Omega)$ and $b_{\mathrm{lower}}$ and $b_{\mathrm{upper}}$ are suitable functions. Note that these functions can be used for constraining the density $\beta^{\prime}(x)$ outside of $\Omega$ to prevent non-physical oscillatory behavior of the resulting function outside of the approximation domain. Also, these constraints are useful in regularization of the low-frequency behavior of materials, see the numerical examples in Section 5. In practice, this might for instance amount to solving for $q_{N}\in W(w,\Omega)$

[TABLE]

where $N$ is fixed, $J$ is a finite index set, and the vector $\theta$ may consist of any of the parameters $\theta_{j}\in\{\check{a},b,\leavevmode\nobreak\ p_{1},\ldots,p_{M},c_{1},\ldots,c_{N}\}$ , for $j\in J$ .

When a priori information is available about the asymptotic properties of a given non-passive system to be approximated and which admits the sum rules discussed in Section 3, the identities (21) can be involved in an optimization (31) as an additional convex constraint. Due to the finite-dimensional approximation (29), the left-hand side of (21) becomes

[TABLE]

for even $k=-2N_{0},\ldots,2N_{\infty}$ , see Theorem 3.3, and which can be employed as an additional constraint in the optimization formulation (32).

5 Numerical examples

In the numerical examples presented below, non-passive approximation is employed as a tool to determine optimal realizations (in the sense of a mathematical representation) of non-passive systems with a given target response over the approximation domain. The target functions to be approximated are symmetric, and thus, we employ symmetric quasi-Herglotz functions to solve the convex optimization problems.

The symmetry property (2) implies that the representation based on (29) and (30) can be simplified as:

[TABLE]

and

[TABLE]

respectively, where $\check{a}=0$ and $p_{0}$ denotes the amplitude of the point mass located at [math]. In connection with the optimization formulation (31) and (32) established above, it is also convenient here to introduce the notation $\Omega_{\mathrm{opt}}=I_{1}\cup I_{2}$ , where $\Omega_{\mathrm{opt}}$ is the optimization domain consisting of two disjoint sets $I_{1}$ and $I_{2}$ where the approximating measure is required to be non-negative ( $p_{i}\geq 0$ or $c_{n}\geq 0$ ) and non-positive ( $p_{i}\leq 0$ or $c_{n}\leq 0$ ), respectively. Hence, the support of the measure is contained in $\Omega_{\mathrm{opt}}$ .

The approximation methods that we have described above are general, and can be applied to any quasi-Herglotz function and for a range of physics and engineering applications. In the examples below, we consider a sequence of interesting different optimization constraints that we think are generally applicable. In particular, we consider different constraints on the optimized function outside the approximation domain, $\Omega$ , where passivity or conditions of non-passivity can be applied. In the first numerical example presented in Section 5.1, we consider a target response, $F$ , which is the restriction of a non-Herglotz function, here $F=-h_{0}$ , where $h_{0}$ is the $\Omega$ restriction of a Herglotz function. In the second example described in Section 5.2, we use the same target response, however, we apply the non-passive approximation framework to determine an optimal realization of the system. Here, we consider a constrained amplifying region over a fixed bandwidth outside the approximation domain. In addition, we study the dependence of the approximation error on the size of the approximation domain for a given fixed optimization domain. The third example in Section 5.3 is focused on the non-passive approximation of a given system, where the approximating quasi-Herglotz function is generated by a measure consisting of point masses. In the fourth numerical example given in Section 5.4, we are interested to determine an optimal non-passive realization of the given system with additional constraints of its asymptotic properties, i.e., the behavior in the small- and large-argument limits. Hereby, we extend the convex optimization problem (32) with an additional sum-rule constraint which is based on (33).

Although the developed theory is generally applicable, in the following examples, we select a canonical electromagnetic application, which is the modeling of permittivity functions that characterize metamaterials with desired exotic properties (fixed negative permittivity, which is of interest in e.g., plasmonic applications). For the non-passive cases, the optimized dielectric permittivity functions $\epsilon_{\mathrm{opt}}$ can have negative imaginary part over some frequency intervals. Note that the functions we study here correspond to linear and stable systems. Consequently, these functions have no poles in the upper-half complex plane. However, these functions can also be used as an input to another system, e.g., the transmission or reflection coefficients [45] from a dielectric slab, and cause an instability of the resulting system. In practice, any instability issues associated with non-passive systems will always be limited by other external factors such as the saturation of the gain media [21], which is not considered in this paper.

Here, the independent variable is a dimensionless real-valued normalized frequency $x$ corresponding to an angular frequency $\omega$ in $\mathrm{\,rad/s}$ . For simplicity and since the approximants $q(x)$ and $q_{N}(x)$ in (31) and (32) are conjugate symmetric on $\mathbb{R}$ , in particular, $\mathrm{Im}\{q_{N}\}$ is an even function when restricted to $\mathbb{R}$ , we will only specify and visualize the right side of the approximation domain, i.e., $\Omega\cap\mathbb{R}_{+}$ .

5.1 Passive approximation of a system with a given target response

An interesting canonical example for which (24) gives a non-trivial bound is with the passive approximation of a negative symmetric Herglotz function $F=-h_{0}$ , which can be Hölder continuously extended to $\mathbb{C}^{+}\cup\Omega$ , and which has the large-argument asymptotics $h_{0}(z)=b_{1}^{0}z+o(z)$ as $z\hat{\to}\infty$ . Based on the theory of Herglotz functions and associated sum rules [6], it can be shown that

[TABLE]

for all Herglotz functions $h$ with large-argument asymptotics $h(z)=b_{1}z+o(z)$ as $z\hat{\to}\infty$ , see [10, 18]. Here, $|\Omega|$ is the length of the interval $\Omega$ .

As an application, consider a passive approximation of a metamaterial as in [10], and note that the case with passive systems and passive approximation based on Herglotz functions as in [18] constitutes a special case of the non-passive approximation based on the representations (34) and (35), with $b_{\mathrm{lower}}=0$ in (31), i.e., $\beta^{\prime}(x)\geq 0$ for all $x$ .

For a passive metamaterial, a dielectric permittivity function $\epsilon(z)$ is considered, where $h(z)=z\epsilon(z)$ is the associated symmetric Herglotz function [10]. The high-frequency permittivity of the metamaterial is assumed to be given by $\epsilon_{\infty}$ , and hence $b_{1}=\epsilon_{\infty}$ . A real-valued and constant target permittivity $\epsilon_{\rm t}<0$ is given over the approximation interval $\Omega$ defined by $\Omega=[1-B/2,1+B/2]$ , where $B$ is the relative bandwidth, $0<B<2$ , and hence $F(x)=x\epsilon_{\rm t}$ with $h_{0}(z)=-z\epsilon_{\rm t}$ , and $b_{1}^{0}=-\epsilon_{\rm t}$ . Now, by using (36) the resulting physical bound can thus be obtained as

[TABLE]

see also [10, 18]. Note that here $\|\epsilon-\epsilon_{\rm t}\|_{\infty}=\|h-F\|_{\mathrm{L}^{\infty}(w,\Omega)}$ , where the weight function is $w(x)=1/x$ for $x\in\Omega$ , with $0\notin\Omega$ , and where $\Delta$ is the resulting physical bound.

Let the target function $F=x\epsilon_{\mathrm{t}}$ be defined over the approximation domain $\Omega=[1-B/2,1+B/2]$ , where the relative bandwidth $B=0.02$ , and let the support of the generating measure $\mathrm{supp}\{\beta\}\cap\mathbb{R}_{+}$ be contained in the optimization domain $\Omega_{\mathrm{opt}}=\{0\}\cup[0.97,1.03]$ including one positive point mass with amplitude $p_{0}$ at the origin, and the density $\beta^{\prime}$ is constrained to be non-negative, i.e., $\beta^{\prime}(x)\geq 0$ .

Figure 1 shows the result of optimization (32) carried out using $N=100$ uniform linear B-splines (of order $m=2$ ) for given parameters $\epsilon_{\mathrm{t}}=-1$ and $\epsilon_{\infty}=1$ . The real and imaginary parts of the resulting permittivity function $\epsilon_{\mathrm{opt}}$ are shown in Figures 1(a) and 1(b), respectively, including comparison with the fundamental sum-rule-bound limits $\epsilon_{\mathrm{t}}\pm\Delta$ , when $\Delta$ is given by (37). Interestingly, the optimized function is, in principal, supported only on $\Omega$ , i.e., the optimal solution for $\beta^{\prime}$ is approximately zero outside the approximation domain $\Omega$ , except for the point mass with $p_{0}\approx 79.1$ . It should be noted that the point mass at the origin contributes with a response at frequencies $x\neq 0$ , which is very similar to that of a Drude model with sufficiently large relaxation time $\tau$ , so that $x\tau\gg 1$ .

5.2 Non-passive approximation of a system with a given target response

Consider an approximation problem, similar to one described in Section 5.1, with the difference that the approximating functions are not restricted to Herglotz functions, but to quasi-Herglotz functions. Let $\Omega_{\mathrm{opt}}=I_{1}\cup I_{2}$ denote the domain of optimization of the measure $\beta$ , where $I_{2}=[0.97,0.99)\cup(1.01,1.03]$ is the frequency set where the density $\beta^{\prime}$ is constrained to be non-positive, i.e., $\beta^{\prime}(x)\leq 0$ .

Figure 2 shows the corresponding optimization results using $I_{1}=\{0\}$ including only a point mass with amplitude $p_{0}$ at the origin. It has been observed that by letting $I_{1}=\{0\}\cup\Omega$ and letting $\beta^{\prime}(x)\geq 0$ over $\Omega$ , one can obtain negligible deviation from the optimal solution presented in Figure 2. In this optimization, we used $N=100$ linear B-splines, which is sufficient to achieve an approximation error in the order of magnitude $10^{-6}$ . Here, the resulting point mass has a magnitude $p_{0}\approx 79.1$ , which is essentially the same as in the example presented in Section 5.1, but the support of the approximating function $q_{N}$ over $I_{2}$ becomes concentrated to the outermost endpoints of the set. This seems to suggest that two point masses with amplitudes $p_{1}$ and $p_{2}$ located at the two outermost endpoints $x_{\mathrm{l}}\approx 0.971$ (lower endpoint) and $x_{\mathrm{u}}\approx 1.029$ (upper endpoint) of $I_{2}$ should be sufficient for the optimization, where the approximating function has the representation based on (34). Hence, Figure 2 also includes optimization results, where the measure $\beta$ consists solely of these two point masses with negative amplitudes $p_{1}$ and $p_{2}$ , together with the original point mass at [math] with amplitude $p_{0}$ . Figure 2(b) also shows the optimized point masses with $p_{1}\approx-8.7$ and $p_{2}\approx-8.48$ (indicated by the dark-red “o”) normalized to the same area as the corresponding linear B-spline basis functions. The example illustrates that it is possible under certain circumstances to obtain a much better realization of the target permittivity $\epsilon_{\mathrm{t}}$ given over the approximation interval with smaller approximation error as a non-passive system by using quasi-Herglotz functions rather than by using only Herglotz functions; compare the results in Figures 2(a) and 1(a), respectively.

Now, let $\Omega_{\mathrm{opt}}=I_{1}\cup I_{2}$ , where $I_{1}=\{0\}$ (where $p_{0}\geq 0$ ), and $I_{2}=[0.97,1-B/2)\cup(1+B/2,1.03]$ (where $\beta^{\prime}(x)\leq 0$ ). The approximation domain is $\Omega=[1-B/2,1+B/2]$ . Figure 3 illustrates how the size of the approximation domain $|\Omega|=B$ affects the optimal realization of the desired system response. Here, the support of the measure $q_{N}$ is concentrated at the outermost frequencies of the set $I_{2}$ as demonstrated in the example above. In Figure 3(a) is shown optimal realizations $\mathrm{Re}\{\epsilon_{\mathrm{opt}}\}$ of the desired target function $\epsilon_{\mathrm{t}}=-1$ for two different sizes of $\Omega$ , where $B=0.02$ and $B=0.056$ , respectively, and a comparison with the passivity bounds $\mathrm{Re}\{\epsilon_{\mathrm{t}}\}\pm\Delta$ defined in (37).

Figure 3(b) shows how the approximation error $\|\epsilon-\epsilon_{\mathrm{t}}\|_{\infty}$ depends on the size of the approximation domain given as a function of the relative bandwidth $B$ , i.e., $\Omega=[1-B/2,1+B/2]$ . From this figure it can be concluded that by increasing the size of $\Omega$ towards the outermost points of $I_{2}$ , the approximation error increases, and for high values of $B$ the optimization results become even worse than the results based on Herglotz functions for the passive case.

5.3 Optimization with point masses

Consider again the optimization based solely on point masses as in Section 5.2 above. For this problem, let $\Omega_{\mathrm{opt}}=I_{1}\cup I_{2}$ , where $I_{1}=\{0\}$ , and $I_{2}=\{x_{\mathrm{l}}\}\cup\{x_{\mathrm{u}}\}$ , denote the domain of optimization of the measure $\beta$ , and $\Omega=[1-B/2,1+B/2]$ the approximation domain. Here, the approximating quasi-Herglotz function has the representation based on (34) with $c_{n}=0$ , $x_{\mathrm{l}}$ and $x_{\mathrm{u}}$ are the lower and upper normalized frequencies, where the point masses with negative amplitudes are located. Note that the optimization is done solely over the three point masses with amplitudes $p_{0}$ (which is located at the origin), $p_{1}$ and $p_{2}$ .

Figure 4 shows the optimization results and how the approximation error $\|\epsilon-\epsilon_{\mathrm{t}}\|_{\infty}$ depends on the location of the assumed a priori point mass, where $\epsilon_{\mathrm{t}}=-1$ is the given target permittivity function. From Figure 4(b), it can be concluded that the non-passive approximation based on point masses provides a good agreement between the target function and the optimal solution based on the approximating quasi-Herglotz function generated by point masses, in particular, the approximation error has an L-curve characterization with transition at about $2\%$ from the normalized frequency $x_{\mathrm{c}}$ ; see the approximation error and the optimal realizations in Figures 4(b) and 4(a), respectively.

5.4 Optimization with sum-rule constraints

Now, we would like to further constrain the optimization (31) to determine an optimal realization of a system with additionally given small- and large-argument asymptotic properties. Consequently, for the given target function $F$ , the convex optimization problem (32) is modified with an additional convex constraint obtained from (33) for $k=-2$ as

[TABLE]

where $a_{1}$ and $b_{1}$ denote the expansion coefficients of the small- and the large-argument asymptotic expansion of the given system response, respectively. Note that $b_{1}$ in constraint (38) coincides with $b$ in the representation (28) of the approximating quasi-Herglotz function.

As an application, modeling of a permittivity function is considered here, where the target permittivity $\epsilon_{\mathrm{t}}=-1$ is fixed over the approximation domain $\Omega=[1-B/2,1+B/2]$ , $0<B<2$ . The small- and the large-argument asymptotics of this system are represented by the static and the high-frequency permittivities, i.e., $a_{1}=\epsilon_{\mathrm{s}}$ and $b_{1}=\epsilon_{\infty}$ , respectively.

Let $\Omega_{\mathrm{opt}}=I_{1}\cup I_{2}$ denote the domain of optimization of $\beta^{\prime}$ , $I_{1}=[0.01,0.9)\cup(1.5,2]$ the frequency set, where the density is restricted to be non-negative, i.e., $\beta^{\prime}(x)\geq 0$ , and $I_{2}=(1.1,1.5]$ the frequency set, where the density $\beta^{\prime}(x)\leq 0$ . For this optimization, the relative bandwidth $B=0.2$ (and hence $\Omega=[0.9,1.1]$ ), the asymptotic constraints are $a_{1}=\epsilon_{\mathrm{s}}=3$ and $b_{1}=\epsilon_{\infty}=1$ , and $N=1000$ linear B-splines are used due to the increased size of $\Omega_{\mathrm{opt}}$ , which is sufficient for an accurate solution. Further, the set $I_{1}$ has been increased in comparison with the previous example to control the realization of the optimal solution with the desired low-argument asymptotic behavior.

Figures 5(a)-(b) depict the corresponding optimization results with no a priori point masses. The obtained optimization result shows a good agreement of the target function $\epsilon_{\mathrm{t}}=-1$ over the approximation domain $\Omega$ ; see Figure 5(b). The approximation error $\|\epsilon-\epsilon_{\mathrm{t}}\|_{\infty}$ is much less than the physical bound for passive metamaterials (37), and hence, the optimal solution $\epsilon_{\mathrm{opt}}$ fits well $\epsilon_{\mathrm{t}}$ over $\Omega$ . Also, it is reassuring to note that the optimal solution satisfies the asymptotic requirement for the small-argument limit, i.e. $\epsilon_{\mathrm{opt}}\rightarrow 3$ as $x\rightarrow 0$ ; see Figure 5(a). We have observed that the same result can be accurately achieved when the measure $\beta$ consists of two point masses with amplitudes $p_{1}\approx 989.9$ and $p_{2}\approx-790.8$ placed at $x_{1}\approx 0.469$ and $x_{\mathrm{u}}\approx 1.499$ , respectively, where $x_{\mathrm{u}}$ is the upper outermost frequency of the non-passive region represented by the set $I_{2}$ . In Figure 5(d), the approximation error is shown as a function of the small-argument asymptotic constraint $a_{1}=\epsilon_{\mathrm{s}}$ . It is interesting to note that the approximation error decreases as $\epsilon_{\mathrm{s}}$ increases, and meanwhile, the location of the point mass with positive amplitude $p_{1}$ moves towards the origin. Hence, in the limit, when the point mass approaches zero, we obtain a result which is very similar to the non-passive approximation case described in Sections 5.2 and 5.3.

6 Conclusions

In this paper, the non-passive framework for a certain class of non-passive causal systems has been formulated. This has been done by extending the existing class of Herglotz functions to the class of quasi-Herglotz functions, which is obtained by taking all possible differences of two Herglotz functions. Based on the integral representation formulas for Herglotz functions using finite measures, we have shown that quasi-Herglotz functions can be described by an integral representation formula using signed Borel measures. For Herglotz functions, one can also use an equivalent, possibly non-finite measure, in their representation formula. However, this is not the case for quasi-Herglotz functions when the measure is non-finite where only some functions admit integral representations via non-finite signed measures. Quasi-Herglotz functions can also be analytically extended to some interval of the real axis in the same way as Herglotz functions, provided the density of measure of the function is Hölder continuous on some open neighborhood of this interval, which is important for the non-passive framework. Furthermore, we show that quasi-Herglotz functions admit, under certain additional constraints, sum-rule identities that generalize the known identities for Herglotz functions, and which allow us to control low- and high-argument asymptotics of desired non-passive systems in optimization problems.

We have also demonstrated that a family of B-splines can be used in the representation of approximating quasi-Herglotz functions, which is utilized in a number of numerical examples. It has been concluded that a very efficient mathematical representation of a non-passive metamaterial with $\epsilon_{\mathrm{t}}\approx-1$ (which is typical in plasmonic applications) can be achieved by choosing point masses representing the power excitation at certain frequencies outside of the approximation domain. A further constrained problem for non-passive metamaterials with controlled low- and high-frequency responses shows that the sum-rule identities can be efficiently utilized in the realization of such permittivities with desired properties as a constraint for the convex optimization problem (31).

This work was supported by the Swedish Foundation for Strategic Research (SSF) under the program Applied Mathematics and the project Complex analysis and convex optimization for EM design.

Data access

The paper has no experimental data. The numerical simulations were carried out using the open-source CVX MATLAB package [41]. All of the data needed to run the simulations is specified in the article.

References

[1]

Nedic M, Ehrenborg C, Ivanenko Y, Ludvig-Osipov A, Nordebo S, Luger A, Jonsson BLG, Sjöberg D, Gustafsson M. 2019 In Herglotz functions and applications in electromagnetics,. IET.

[2]

Zemanian AH. 1965 Distribution theory and transform analysis: an introduction to generalized functions, with applications. New York: McGraw-Hill.

[3]

Kac IS, Krein MG. 1974 R-functions - Analytic functions mapping the upper halfplane into itself. Am. Math. Soc. Transl. 103, 1–18.

[4]

Akhiezer NI. 1965 The classical moment problem. Oliver and Boyd.

[5]

Nussenzveig HM. 1972 Causality and dispersion relations. London: Academic Press.

[6]

Bernland A, Luger A, Gustafsson M. 2011 Sum rules and constraints on passive systems. Journal of Physics A: Mathematical and Theoretical 44, 145205.

[7]

Gesztesy F, Tsekanovskii E. 2000 On matrix-valued Herglotz functions. Math. Nachr. 218, 61–138.

[8]

Youla D, Castriota L, Carlin H. 1959 Bounded real scattering matrices and the foundations of linear passive network theory. IRE Transactions on Circuit Theory 6, 102–124.

[9]

Rozanov KN. 2000 Ultimate Thickness to Bandwidth Ratio of Radar Absorbers. IEEE Trans. Antennas Propagat. 48, 1230–1234.

[10]

Gustafsson M, Sjöberg D. 2010 Sum rules and physical bounds on passive metamaterials. New Journal of Physics 12, 043046.

[11]

Gustafsson M, Sjöberg D. 2011 Physical bounds and sum rules for high-impedance surfaces. IEEE Transactions on Antennas and Propagatation 59, 2196–2204.

[12]

Gustafsson M, Sohl C, Kristensson G. 2007 Physical limitations on antennas of arbitrary shape. Proc. R. Soc. A 463, 2589–2607.

[13]

Jonsson BLG, Kolitsidas CI, Hussain N. 2013 Array Antenna Limitations. Antennas and Wireless Propagation Letters, IEEE 12, 1539–1542.

[14]

Gustafsson M. 2010 Sum rules for lossless antennas. IET Microwaves, Antennas & Propagation 4, 501–511.

[15]

Vakili I, Gustafsson M, Sjöberg D, Seviour R, Nilsson M, Nordebo S. 2014 Sum Rules for Parallel-Plate Waveguides: Experimental Results and Theory. IEEE Transactions on Microwave Theory and Techniques 62, 2574–2582.

[16]

Gustafsson M, Vakili I, Keskin SEB, Sjöberg D, Larsson C. 2012 Optical theorem and forward scattering sum rule for periodic structures. IEEE Trans. Antennas Propagat. 60, 3818–3826.

[17]

Nordebo S, Gustafsson M, Nilsson B, Sjöberg D. 2014 Optimal realizations of passive structures. IEEE Trans. Antennas Propagat. 62, 4686–4694.

[18]

Ivanenko Y, Gustafsson M, Jonsson BLG, Luger A, Nilsson B, Nordebo S, Toft J. 2019 Passive approximation and optimization using B-splines. SIAM Journal on Applied Mathematics 79, 436–458.

[19]

Maier SA. 2007 Plasmonics: Fundamentals and Applications. Berlin: Springer-Verlag.

[20]

Capolino F. 2009 Metamaterials handbook: theory and phenomena of metamaterials. Boca Raton: CRC.

[21]

Lawandy NM. 2004 Localized surface plasmon singularities in amplifying media. Applied Physics Letter 85, 5040–5042.

[22]

Skaar J, Seip K. 2006 Bounds for the refractive indices of metamaterials. J. Phys. D: Appl. Phys. 39, 1226–1229.

[23]

Govyadinov AA, Podolskiy VA, Noginov M. 2007 Active metamaterials: Sign of refractive index and gain-assisted dispersion management. Applied Physics Letters 91, 191103.

[24]

Lind-Johansen Ø, Seip K, Skaar J. 2009 The perfect lens on a finite bandwidth. Journal of mathematical physics 50, 012908.

[25]

Campione S, Albani M, Capolino F. 2011 Complex modes and near-zero permittivity in 3D arrays of plasmonic nanoshells: loss compensation using gain. Optical Materials Express 1, 1077–1089.

[26]

Safian R, Mojahedi M, Sarris CD. 2007 Asymptotic description of wave propagation in an active Lorentzian medium. Physical review E 75, 066611.

[27]

Webb KJ, Thylén L. 2008 Perfect-lens-material condition from adjacent absorptive and gain resonances. Optics letters 33, 747–749.

[28]

Wuestner S, Pusch A, Tsakmakidis KL, Hamm JM, Hess O. 2011 Gain and plasmon dynamics in active negative-index metamaterials. Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 369, 3525–3550.

[29]

King FW. 2009 Hilbert transforms vol. I–II. Cambridge University Press.

[30]

Zemanian AH. 1996 Realizability Theory for Continuous Linear Systems. New York: Dover Publications.

[31]

Beltrami EJ, Wohlers MR. 1966 Distributions and the boundary values of analytic functions. New York: Academic Press.

[32]

Milton GW, Eyre DJ, Mantese JV. 1997 Finite frequency range Kramers-Kronig relations: bounds on the dispersion. Phys. Rev. Lett. 79, 3062–3065.

[33]

Baratchart L, Leblond J. 1998 Hardy Approximation to ${\rm L}^{p}$ Functions on Subsets of the Circle with $1\leq p\leq\infty$ . Constructive approximation 14, 41–56.

[34]

Baratchart L, Leblond J, Seyfert F. 2009 Constrained extremal problems in the Hardy space H2 and Carleman’s formulas. arXiv preprint arXiv:0911.1441.

[35]

Kress R. 1999 Linear Integral Equations. Berlin Heidelberg: Springer-Verlag second edition.

[36]

Rudin W. 1987 Real and Complex Analysis. New York: McGraw-Hill.

[37]

Dahlquist G, Björck Å. 1974 Numerical methods. Englewood Cliffs, New Jersey: Prentice-Hall, Inc.

[38]

De Boor C. 2001 A practical guide to splines vol. 27Applied Mathematical Sciences. Springer-Verlag New York revised edition.

[39]

Dagnino C, Santi E. 1991 On the convergence of spline product quadratures for Cauchy principal value integrals. Journal of computational and applied mathematics 36, 181–187.

[40]

Boyd S, Vandenberghe L. 2004 Convex Optimization. Cambridge University Press.

[41]

[42]

Ivanenko Y. 2017 Estimation of electromagnetic material properties with application to high-voltage power cables. Licentiate thesis, Department of Physics and Electrical Engineering, Linnæus University, 351 95 Växjö, Sweden.

[43]

Nordebo S, Dalarsson M, Ivanenko Y, Sjöberg D, Bayford R. 2017 On the physical limitations for radio frequency absorption in gold nanoparticle suspensions. J. Phys. D: Appl. Phys. 50, 1–12.

[44]

Ivanenko Y, Nordebo S. 2016 Approximation of dielectric spectroscopy data with Herglotz functions on the real line and convex optimization. In 2016 International Conference on Electromagnetics in Advanced Applications (ICEAA) pp. 863–866.

[45]

Kristensson G. 2016 Scattering of Electromagnetic Waves by Obstacles. SciTech Publishing, Edison, NJ.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Nedic M, Ehrenborg C, Ivanenko Y, Ludvig-Osipov A, Nordebo S, Luger A, Jonsson BLG, Sjöberg D, Gustafsson M. 2019 In Herglotz functions and applications in electromagnetics ,. IET.
2[2] Zemanian AH. 1965 Distribution theory and transform analysis: an introduction to generalized functions, with applications . New York: Mc Graw-Hill.
3[3] Kac IS, Krein MG. 1974 R-functions - Analytic functions mapping the upper halfplane into itself. Am. Math. Soc. Transl. 103 , 1–18.
4[4] Akhiezer NI. 1965 The classical moment problem . Oliver and Boyd.
5[5] Nussenzveig HM. 1972 Causality and dispersion relations . London: Academic Press.
6[6] Bernland A, Luger A, Gustafsson M. 2011 Sum rules and constraints on passive systems. Journal of Physics A: Mathematical and Theoretical 44 , 145205.
7[7] Gesztesy F, Tsekanovskii E. 2000 On matrix-valued Herglotz functions. Math. Nachr. 218 , 61–138.
8[8] Youla D, Castriota L, Carlin H. 1959 Bounded real scattering matrices and the foundations of linear passive network theory. IRE Transactions on Circuit Theory 6 , 102–124.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Quasi-Herglotz functions and convex optimization

Abstract

1 Introduction

2 Quasi-Herglotz functions

2.1 Background

2.2 Basic properties

Definition 2.1

2.3 Integral representations

2.4 Boundary values

Theorem 2.2

3 Sum rules

Definition 3.1

Definition 3.2

Theorem 3.3

**Proof **

Remark 3.4

Remark 3.5

Remark 3.6

4 Approximation and optimization based on quasi-Herglotz functions

4.1 Approximation theory based on quasi-Herglotz functions

Definition 4.1

Remark 4.2

Definition 4.3

Remark 4.4

Theorem 4.5

**Proof **

Definition 4.6

Theorem 4.7

Corollary 4.8

4.2 Convex optimization with B-splines

5 Numerical examples

5.1 Passive approximation of a system with a given target response

5.2 Non-passive approximation of a system with a given target response

5.3 Optimization with point masses

5.4 Optimization with sum-rule constraints

6 Conclusions

Data access

References

Proof

Proof